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PROGRESS  TO  OATE  ON  COMPUTING  REGRESSION  BASED  ESTIMTES 
OF  CLIMATIC  CHANGES  FOLLOWING  VOLCANIC  ERUPTIONS 


John  Bart  Wilburn 
US  Anny  Electronic  Proving  Ground 
Fort  Huacbuca,  Arizona  85613 


ABSTRACT 

Report  and  Invite  comments  on:  Problem  addressed,  method  of  analysis, 

and  results  to  date. 

Intent  of  project  is  to  produce  regression  based  estimates  of  seasonal 
temperatures  and  precipitation  at  several  locations  by:  Performing 

a multivariate  analysis  of  Tree  Ring  data  from  selected  sites  In 
North  America  and  perform  a subsequent  multivariate  regression  of  the 
Tree  Ring  data  against  meteorological  data. 


T ^ 


PURPOSE' 


Tht  purpose  of  this  project  Is  to  detect  tree  growth  enomofles  following  vokenrc 
eruptions  by  analyzing  tree  ring  growth  pattarns  and  using  modern  meteorological  data  with 
coincldant  traa  ring  data  to  davtlop  transfar  functions  for  raconstrueting  climata  anomoiias 
following  volcanic  eruptions. 

Thasa  climata  anomoly  pattarns  can  ba  compared  with  other  derived  peleoclimate 
anomoiias  for  further  understanding  of  the  environment. 

PROCEDURE; 

The  procedure  involve*  several  staps  of  analysis.  First  the  analysis  of  tha  tree  ring  data  to 
detect  statistically  significant  responses  of  tree  sites  to  volcanic  eruptions.  The  volcanic 
eruption  data  was  selected  from  H.  H.  Lamb  (1969),  Volcanic  Dust  in  the  Atmosphere.  (') 
Tha  tree  ring  data  were  selected  from  Schulman  (1956)  (2)  and  were  restricted  to  Douglas 
Fir  trees  with  good  intercorrelation,  high  sensitivity,  and  with  sufflciant  length  of  sample  to 
incorporate  most  of  tha  volcanic  data.  Tha  tree  ring  data  was  selected  from  ten  sites  (fig.  1) 
to  span  a significant  portion  of  tha  Westarn  North  American  Continent  so  as  to  obtain  a 
good  sample  of  a large  scale  climatic  condition.  (29°N  - 52°N,  1 05°W  - 121  °W). 

These  tree  ring  deta  (percent  of  normal  growth)  were  then  arranged  into  a 14— year  lagged 
array.  That  is  to  1st  column  in  years  1 (referenced  to  tha  beginning  of  the  chronology)  to 
14.  The  second  columns  are  years  2 to  IS  and  so  on  to  the  last  row  of  M-  13  to  M for  a 
chronology  of  M years.  This  array  Is  referred  to  as  The  Total  Ring  Data  array.  From  this 
array,  for  each  site,  was  extracted  a subset  referred  to  as  The  Ring  Signal  Data  Array. 

A second  subset  Is  created  by  implication  of  tha  first.  That  second  subset  is  the  remainder 
of  the  Total  Ring  Data  and  is  referred  to  as  The  Background  Ring  Data  array.  These  arrays 
are  denoted  by:  D*N  M , Total  Ring  Array;  D SN  p , Ring  Signal  Array;  and  DbN  R, 
Background  Array. 

The  D s are  picked  from  D 1 in  the  following  manner.  Tha  volcanic  aruptions  are 
parameterized  by  date  of  eruption  in  years,  location  in  latitude  and  longitude  and 
magnitude  of  eruptions  denoted  by  a dust  veil  index  (d.v.i)  devised  by  H.  H.  Lamb.  (pp. 
471-4  73 ) 

A class  of  aruptions  is  specified  by  bounds  on  these  parameters.  The  dates  of  the 
eruptions  within  these  bounds  ere  translated  to  column  numbers  of  D l.  These  columns  of 
D 1 selected  in  this  menner  are  extracted  from  D*  end  comprise  the  array  D s of  N rows 
and  the  number  of  columns  determined  by  the  number  of  eruptions  in  the  specified  class 
called  for. 

The  test  for  significent  responses  is  i two-fold  test.  First,  a CHI— SQUARE  test  was 
performed  as  follows;  A CHI— SQUARE  test  was  performed  on  the  row  averages  of  D s 
against  the  hypothesis  of  being  indistinguishable  from  the  row  averages  of  (a  ) D * and  (b ) 
D b.  At  the  same  time,  a CHI— SQUARE  test  was  performed  on  tha  row  averages  of  D b 


• • • • « •••••••  eeese 

* s 5 ) i : : nm  ssiJs 


against  the  hypothesis  of  being  distinguishable  from  tha  row  average  of  (e  ) O ',  and  ? 
CHI— SQUARE  test  on  the  row  evereget  of  D ' against  the  hypothesis  of  being 
distinguishable  from  tht  average  of  tha  total  traa  ring  chronology  ( d).  If  all  hypothesis  art 
rejected,  that  it  tha  probability  of  0 5 being  a chanct  variation  of  D * or  D b it  low  whila  at 
tha  tarn#  tima  tha  probability  of  D b bting  a chanct  variation  of  D 1 it  high  and  that  rowt  of 
0 ' art  ail  chanct  variation  of  tha  total  ring  average,  then  the  tat  D 5 it  labeled  at  a 
candidate  for  the  tacond  teat. 

An  example  of  thit  firit  tett  it  teen  below  (fig.  2).  Tha  error  termt  are  standard 
deviations  The  example  picked  it  the  Tree  Ring  chronology  from  the  Fraser  River  Batin. 
The  volcanic  criteria  wit:  Magnitude  500  - 5000  d.v.i.,  latitude  20°N  • 90°N,  longitude  0° 
to  I35°W. 


Tht  tacond  tett  involved  an  eigenvector  companion.  Tha  software  which  built  the  Arrays 
D*,  D*  and  D®  and  computed  tha  CHI-SQUARE  test  was  extended  to  perform  a correlation 
matrix  calculation  and  an  eigenvector  extraction.  An  example  of  the  printout  it  teen  in 


figure  3 for  the  correlation  matrix  C*MN  « D ^ uNltht»ijtnv«t°f  »tE  1 

and  the  eigenvalues  A * . Thit  computation  was  performed  for  correlation  matrices  and 
'.heir  associated  tat  of  aigenvector/eiganvaiues,  for  variarvea  about  tha  row  averages  of  each 
of  tha  arrays.  That  it,  tha  data  for  D * , D 5 and  D * were  normalized  with  respect  to  their 
own  row  averages. 


There  were  tome  interesting  development!  from  these  eigenvector!  at  teen  In  figures  4,  5, 
and  6.  Theta  vectors  are  from  O * , the  total  ring  array.  Each  eigenvector  epptart  to  be  a 
compotita  of  sinusoids  of  increasing  complexity.  The  first  and  second  vectors  being 
predominately  half  waves  of  a fundamental  and  increasing  from  there  on.  The  explanation 
of  this  bahavlor  It  not  settled  as  yat. 


Soma  commants  on  what  is  being  done  as  an  aid  in  intarpratation  are  due  hart.  Tha 
matricas,  D *,  as  wall  at  tha  others,  are  correlated  by  rows. 


That  is,  w«  art  looking  at  tha  correlation  of  a pattern  of  growth  beginning  in  one  year  and 
running  sequentially  with  a pattern  of  growth  beginning  In  another  yeer  and  running 
sequentially . In  short,  wa  have  a type  of  autocorrelation.  In  this  context,  however,  we  might 
explain  it  is  the  correlation  of  a growth  sequence  with  any  set  of  previous  growth 
conditions  of  each  element  of  the  sequence.  The  eigenvectors  depict  the  relative 
contribution  of  the  respective  rows  to  the  totil  variance  of  D 1 accounted  for  as  indicated 
by  the  relative  magnitude  of  thair  associated  eigenvalues,  or  the  mode  of  variance  associated 
with  thit  eigenvalue. 

The  notion  of  the  mode  of  variance  in  years  following  the  year  of  tha  first  row  it 
particularly  useful  whan  wa  are  interpreting  the  average  growth  d 5 , and  aiganvactor  of  tha 
Ring  Signal  array  D s.  This  is  because  now  wa  are  talking  about  modas  of  variance  in  years 
following  an  e-uption  in  a specific  class  of  volcenic  eruptions. 
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Figure 


volcanic  criteria 
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This  brings  w to  the  second  tot  for  significance  • • the  comparison  of  eigenvectors.  Tha 
Ring  Signel  array*  D * war*  analysed  a second  time.  This  tiers • tha  row  averages  of  0 * wara 
subtracted  from  tha  elements  of  identical  row*  of  D 5 and  than  a covariance  matrix  wat 
computed  from  tha  now  array  D*  and  tha  eigenvectors  axtractad  from  tha  eovarianca 
matrix.  To  Insure  that  wa  ware  not  comparing  unrelated  quantities,  tha  eigenvectors  of  tha 
eovarianca  matrix  of  D*,  0s  and  DO  ware  recomputed  using  their  own  row  averager  of 
their  respective  arrays  as  used  previously  for  tha  eorralation  matrix  computation.  Tha 
eigenvectors  computed  from  the  covariance  matrix  were  nearly  identical  with  those  from 
tha  correlation  matrix.  This  was  to  bs  expected  since  variances  of  the  row  variables  are 
nearly  tha  same. 

The  rationale  behind  this  move  was  tha  following.  The  eigenvectors  extracted  from  the 
correlation  matrix  of  0 * using  its  own  row  averages  comprise  a description  of  the  mode*  of 
variance  about  the  response  signal  of  tha  treat  to  volcanos,  if  there  Is  one;  whereat,  tha 
eigenvectors  extracted  from  the  covariance  matrix  of  D * using  the  row  averages  of  D < 
comprise  a description  of  tha  variance  of  tha  responee  signal  of  the  trees  about  the 
background  signal  of  tree  growth.  Based  on  this  reasoning,  if  the  two  sets  of  eigenvectors  are 
needy  the  seme,  then  the  array  D 5 is  labeled  as  a type  I error  and  rejected.  There  are  more 
rigorous  statistical  techniques  for  compering  the  eigenvectors  (4)  but  the  situation  hare  does 
not  stem  to  warrant  that  degree  of  rigor,  if  the  two  sets  of  eigenvectors  are  significantly 
different,  than  tha  array  0 * is  labeled  as  ■ significant  response  signal  to  voicinoes  and  the 
eigenvectors  of  the  eovtvianee  matrix  are  considered  as  tha  modes  of  variance  of  the 
response.  Note  that  becausa  tha  variance  is  Indicated  by  tha  square  of  eigenvector 
component,  a mirror  image  Is  considered  as  the  um>  mode. 

An  example  of  this  comparison  is  seen  in  figure  7 which  riiows  the  average  growth 
(a  ) of  the  Freeer  River  Chronology  for  14  years  following  an  eruption  specified  by  tha 
class  500  • 5000  d.v.l.,  0 • 135°  Long,  20  • OfPN  Let,  and  tha  eigenvector,  Ev,  axtractad 
from  tha  eovarianca  matrix  and  tha  eigenvector,  Ev,  extracted  from  the  correlation  matrix. 

Tha  results  of  these  tests  were  the  selection  of  four  sites,  one  with  two  cates,  making  a 
total  of  five  cases.  Tha  sitaa  and  their  case  were: 

Fraser  River  Basin i 500  — 5000  d.v.l. 

20®  - 90  °W  tat. 

0°  — 135°W  long. 

Saskatchewan  River  500  — 5000  d.v.i. 

Batin:  20°  - 90° N let. 

0°  — 1 35°W  long. 

Missouri  River  Basin:  500  - 5000  d.v.l. 

20°  - 90 °N  let. 

0°  — 135°W  long. 


500  - 
20°  - 
0 - 


5000  d.v.i. 
90 °N  let. 
135°W  long. 
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Big  Band: 


0 - 500  d.v.i. 

± 20°N  (at. 

1 I80°W  long.  v 

Nota  that  with  four  of  tha  cases,  a latitudinal  dapandance  may  be  investigated. 

Their  average  growth  curves  ( Js,  ) and  their  first  eigenvector  (Ev)  are  shown  in  figures 
8 and  9 respectively.  Thera  does  not  seam  to  be  anything  consistent  In  tha  growth  curves 
but  tha  eigenvectors  indicate  a definite  similarity  of  modes  of  variances  between  Fraser 
River  and  Saskatchewan  River  and  between  Missouri  River  and  Big  Bend  chronologies.  One 
mutt  remember  that  tha  growth  curves  depict  the  result  of  change  from  a previous  set  of 
initial  conditions  of  growth  and  climate,  whereas  the  eigenvectors  depict  the  mode,  or 
mechanism,  of  that  change. 

METHOD 

The  next  part  of  tha  project  was  to  use  these  chronologies  from  the  four  sites  to 
estimate  the  seasonal  temperature  and  precipitation  at  or  near  the  tree  sites  during  the  14 
years  following  the  eruptions.  Because  of  the  nature  of  the  tree  growth  physiology,  the 
seasonal  data  was  referenced  to  the  preceding  year.  For  example,  precipitation  during  the 
winter  season  preceding  the  year  of  the  tree  growth  (8).  Tha  seasons  were  divided  Into:  (a) 
preceding  year  ending  30  May;  (b)  preceding  summer  consisting  of  months  June,  July, 
August  and  September;  (c)  preceding  winter  consisting  of  months  October,  November, 
January,  and  February;  and  (d)  the  preceding  spring  consisting  of  months  March,  April,  and 
May. 


The  regression  based  estimate  was  performed  by  e regression  analysis  technique 
referred  to  as,  "Prlncipel  Component  Regression  Analysis.”  It  is  described  In  detail  In  a 
paper  (5)  to  be  published  separately  and  Is  included  as  an  appendix  In  this  clintcal  report. 

The  essence  of  the  principal  component  regression  analysis  is  that  it  allows  tha  physical 
phenomena,  considered  as  a system,  to  be  partitioned  into  Independent  and  orthogonal 
modes  of  variance,  or  principal  components,  and  then  to  allow  only  those  modes  of  variance 
of  the  regressand  phenomena  which  correlate  well  with  all  of  the  allowed  modes  of  variance 
of  the  regressor  phenomena  to  be  used  in  the  estimate  of  the  regressand.  This  technique 
further  allows  a selective  reduction  of  error  in  the  regressand  estimate. 

All  of  the  properties  mentioned  above  are  consequences  of  the  orthogonality  and 
Independence  of  the  principal  components  of  original  data. 

Quantitatively,  the  regression  rationale  it  as  follows  in  a brief  outline.  We  have  a set  of 
tree  ring  data  D*NM  from  which  a complete  set  D’N(J  can  be  selected  which  matches, 
chronologically,  a set  of  meteorological  data  F^u  also  formatted  Into  a lagged  array.  The 
meteorological  data  is  from  a station  at  or  near  the  tree  site.  From  these  two  sets  of  data  are 


com pu tad  tha  correlation  matrices  CdNN  and  CfNN  and  subsequently  the  eigenvector 
uti  are  extracted  satisfying  tha  equations) 

CdNNENN  " ENNANN 
C*NN  GNN  ■ GNNfiNN 

The  unitary  transformation  of  D and  F into  their  respective  principal  components  is 
performed  by: 


*NU  " ENN  D'nU 

t 

ynu  *qnu  fnu  (1) 

Now  than,  from  XNU  wo  select  a specified  number  of  components  q accounting  for  the 
amount  of  variance  requested.  This  Is  determined  from  the  knowledge  of  the  fact  that  tha 
amount  of  variance  accounted  for  by  the  Ith  principal  component,  X lu  , is  given  by 


VX.u)' 


tr  A 


NN 


Thus  all  have  a sat  XqU  accounting  for  a specified  amount  of  variance  given  Lyi 


vu<*  gU> 
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NIX 


A set  of  regression  equations  0 Nq  are  calculated  such  that  w*  have  a regression  model  of 

VNU-0Nq\u+  *NN  <*» 


It  is  worth  noting  that  because  tha  X N(J  are  all  independent,  the  q coefficients  of  tha 
N TH  equation  are  completely  independent.  Also,  the  multiple  correlation  coefficients, 
R2n  are  unambiguous  because  tha  joint  confidence  region  of  the  regression  equation  Is 
unambiguous.  In  any  case,  recalling  the  transformation  (1),  (2)  can  be  restated  as 


FNP  "GNN  0Nq  Eqq  G%P 


(3) 


Those  equations  0 Nq  which  fail  an  F-tast  against  the  hypothesis  C 0 ■ 0 are  set  to  aero. 
This  amounts  to  a kind  of  stepwise  regression  except  that  tha  variables  rejected  are  those 
modes  of  variance  of  the  system  of  F which  have  an  Insignificant  statistical  relation*) Ip  with 
any  combination  of  all  of  the  modes  of  variance  of  the  system  of  D. 


Tha  confidence  bounds  (B0%)  of  the  estimate,  FNP,  indicated  by  5FNp  are 
computed  from  the  confidence  bounds  of  0 ^ indicated  by  VKqq  ; K ■ 1,  N.  The 
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computation  It  performed  by  (Cf.5) 


6FNP  - GNN  \ ‘(P-tJ-t . 1-or/2)IC  * (D%n  ENq  V*„EqN  0\, ) W)  } (4) 

Summarizing,  the  attimatlon  of  FNP  from  DSNP  can  ba  parformad  by  tranifar  function 


TNN  “^NN^tjq^qN 

and  tha  calculation  of  tha  confidanca  bound  of  tha  estimate  FNp  can  ba  parformad  by  an 
operator  function  of  6FNpo  onD5NP  at  Indicated  In  (4). 

Thit  analytis  was  implemented  in  tha  mannar  indicated  by  tha  flow  chart  rttown  In 
figure  10. 

Tha  results  to  data  are  for  data  selected  from  tha  Fraser  River  Chronology  for  tha  class 
of  eruptions  specified  by  500  — 5000  d.v.l.,  0a  - 135°W  Long,  20°  ■ 90°N  Let.  Tha 
meteorological  data  tasted  wa*  tha  pre-tummet  precipitation  from  Kamloops,  Alberta, 
Canada. 

Tha  regression  based  estimates  of  tha  pre-summer  precipitation  In  Kamloops,  Canada 
was  made  by  using  the  bast  estimates  for  each  of  the  14  years  selected  from  tha  regressions 
specifying:  SO  percent,  90  percent,  95  percent  and  100  percent  of  tha  variance  of  the  Tree 
Ring  data  system  and  accepting  tha  regression  aquations  which  pass  tha  90  portent 
confidanca  F-test. 

Figure  1 1 (Plata  1 0)  of  Appendix  A is  an  example  printout  of  the  principal  component 
regression  computer  program  run  of  a CDC  6500  for  the  case  of  60  percent  verlsrtca 
requested.  Note  the  program  computes  the  estimate  twice;  once  before  the  F-test  rejection 
and  then  again  incorporating  tha  F-test  rejection. 


Figure  12  illustrates  the  estimates  of  the  pre-summer  precipitation.  These  estimates  ere 
the  composite  of  the  best  results  of  all  four  cases  (80%,  90%,  95%  and  100%  of  Tree  Ring 
data  variance). 
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Fl^un  1J  (Continued) 


Regression  bated  eitimates  of  summer  precipitation  in  Kamioopi,  Canada  during:  (a) 
1593-1566,  (b)  1600-1613,  (c)  1624-1637,  (d)  1659-1672,  (a)  1720-1733,  (f)  1754-1767, 
(«)  1765-1778,  (h)  1782-1798,  (I)  1844-1857,  (j)  1865-1878. 

Figure  13  It  the  plot  of  the  average  of  the  regression  bated  eitimates  of  the  pre-summer 
precipitation.  This  average  Is  avareged  over  the  tan  chronologies  for  each  of  the  14  years 
following  the  eruption.  The  error  bases  are  the  root-mean  square  of  the  errors  of  each  of  the 
ten  values  in  the  average. 

Figure  14  comparts  the  curve  plotted  in  figure  13,  re-normalized,  to  the  normalized, 
average  growth,  d~ s j , of  tha  F rater  River  Basin  tree  ring  chronology.  Note  that  since  the 
precipitation  estimates  are  of  the  preceding  summer  of  the  ring  growth  index,  only  13  values 
are  plotted.  The  striking  feature  of  this  plot  is  that  the  curves  seem  to  Have  a high 
correlation.  It  is  , in  feet,  0.68  which  seems  to  imply  that  the  assumption  that  the  tree 
growth  in  any  one  year  Is  dependent  on  the  precipitation  in  the  summer  preceding  the 
growing  season  rather  than  on  the  summer  of  the  current  growing  season  is  not  strictly  true. 
In  fact,  the  dependence  Is  on  both  and  whan  one  contidon  trees  in  the  northern  letitudas. 
the  dependence  on  precipitation  of  the  current  growing  season  increases.  This  can  be  tested 
by  repeating  the  experience  using  summer  precipitation  from  the  current  growing  season 
and  then  see  which  regression  produces  estimates  with  the  highest  prr-.lslon.  However,  due 
to  the  sampling  nature  of  the  decomposition  of  the  data  systems  into  principal  components, 
the  components  which  heavily  weight  tha  first  row  of  D *N^  will  not  correlate  highly  with 
the  similar  component  of  YN^  . For  that  reason,  when  one  deals  with  a lagged  array,  a 
mistaken  assumption  on  time  coincidence  does  not  cause  a complete  miss  on  tha  regression 
analysis. 

The  curves  of  figures  12  and  13  are  Interpretable  as  follows.  The  curve  in  figure  14  Is  a 
general  estimate  of  the  summer  proclpitations  following  an  eruption  of  a large  volcano, 
whereas  the  curves  of  figure  12  are  specific  estimates.  The  estimates  are  given  by  year  with 
90%  confidence  bounds.  As  one  can  see,  some  of  the  estimates  have  confidence  bounds  so 
large  as  to  constitute  essentially  no  estimate  et  all.  Within  the  confidence  bounds  calculated 
for  each  point,  the  curve  In  figure  13  agrees  with  most  of  the  curves  In  figure  12. 
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ABSTRACT:  Development  of  the  mathematical  rationale  of  multivariate  regression 
between  sets  of  principal  components  with  a demonstration  of  a computer  program 
implementing  the  rationale. 


The  intent  of  this  paper  is  to  propose  a user-oriented  method  of  multivariate  linear 
regression  which  will  reduce  the  uncertainty  of  the  user  by  eliminating  the  unwanted  effects 
of  intercorrelstion  of  variables  and  to  enable  the  user  to  eliminate  unnecessary  variables 
with  predictable  results. 

Procedure: 

The  situation  is  that  of  two  sets  of  data:  Regressor  data,  DNM;  N variables  and 
M measurements  and  regressand  data;  FNM  also  of  N variables  and  M measurements.  In 
general,  the  sets  D and  F will  not  be  of  the  same  number  of  variables,  but  for  purposes  of 
development,  they  will  be  considered  the  same  without  any  loss  of  generality. 

The  user  supposes  that  he  has  two  systems,  D and  F,  adequately  described  by  the 
variables  in  each.  The  user  further  acknowledges  that  the  systems  are  very  likely  noisy  and 
that  he  has  observed  them  long  enough  to  have  a representative  sample  of  the  variance  in 
each  and  also  that  normality  can  be  assumed.  Having  satisfied  these  assumptions,  the  user 
may  proceed  as  follows: 

First,  compute  the  variance/co-variance  matrices  of  the  two  normalized  data  sets 
nn  m _ , unm  umn 


and 


NN  M _ , rNM  r MN 


Next,  perform  the  eigenvalue/eigenvector  calculations 


C°NN  ENN  “ ENN^NN 
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where  E and  0 ara  ©rthonormal  wta,  ay. 

£ e*„  ■ 1 and  £ •* ,,  ■ 1 . 
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Tha  algan  vector  nti  ara  uaad  to  compute  the  principal  component*  by  the  unitary 
transformation  of  i 


xnm  * 1 *hdnm 


VNM  " 0 NNFNM‘ 

Users  from  the  phytic  el  sciences  will  recognize  this  as  analogous  to  a principal  axl* 
transformation. 

Each  of  thoaa  principal  component  sat*  oo  mitts  of  vectors  which  ara  Indepond  ant  and 
orthogonal.  Furthermore,  each  of  the  vectors  raprasanti  a specific  “mode”  of  variance  of  the 
system  which  Is  Independent  and  orthogonal  to  the  other  N — 1 modea.  The  time  Independent 
modes  them  wives  ere  represented  by  the  eigen  wet  art  associated  with  the  principal  component 
In  question  Indicating  the  relative  contribution  of  each  of  the  original  N variables  In  D nr  F to 
that  mode. 

The  following  operation  demonstrates  the  orthogonality  properties  of  the  principal 
components  and  alto  one  other  very  useful  property. 

We  will  use  the  sat  X but  the  same  applies  to  Y.  Compute  the  variance/co-variance 
matrices  of  the  principal  components 
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which  reduces  to 


Thu*,  the  principal  component*  are  orthogonal  and  now  wa  can  *aa  that  iha  variance  of  each 
of  the  principal  component*  I*  fliven  by  the  eigenvalue  aiaociatad  with  the  eigenvector  u*ed 
to  computa  that  particular  component.  Vlawad  in  thi*  way,  the  total  variance  of  the  original 
data,  Dmm,  I*  partitioned  by  tha  eigenvector*  ENN  with  the  relative  amount  of  the  variance 
accounted  for  by  the  iTHprincipel  component  given  by 


Relative  Var.  (X,„  )■ 

“ *r  A„„- 

H should  be  noted  hare  that  If  the  variable*  in  D and  P are  of  the  tame  unit*  and  variance, 
the  correlation  metric  a*  C computed  from  tha  normailaed  data  can  be  replaced  by  the 
cowarlance  matrix  computed  from  unnormalized  data.  Thl*  may  appeal  to  tome  user*. 
However,  under  thoee  condition*  of  equal  variance  and  unite,  this  writer’s  experience  he* 
been  that  tha  eigenvector*  are  vary  nearly  tha  tame  at  those  from  tha  correlation  matrix.  It 
it  whan  tha  variance*  are  not  the  tame  that  tha  sampling  properties  of  tha  eigenvector*  differ 
depending  on  whether  they  are  extracted  from  tha  correlation  matrix  or  from  tha 
covariance  matrix.  It  I*  my  feeling  that  tha  correlation  matrix  It  beat  for  general  use. 

These  properties,  orthogonality , independence,  and  the  partitioning  of  tha  variance  will 
be  seen  to  be  very  useful  in  tha  following  development  of  tha  ragrootlon  postulating  tha 
modal  of 
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VNM  ' & NN  XNM  * e NM* 

From  tha  above  comments,  we  now  know  that  both  the  XNfc(  and  YNM  are  distributed 
according  to  X*M  ~ N(0,  A^^nd  YNM~  N(0,  flNN). 

Thus,  the  estimate  of  0NN,  Is  found  by; 
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whkh  reduces  to 
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We  can  incorporate  the  factor  - Into  tha  reiation#»lp  by  fatting 


A*nn"<M-1)-Ann 
and  similarly 


fi*NN  ■ (M-i)*nMW 


Tha  matrix  of  tha  residual  sum  of  squares  ZNM  of  tha  regression 
maximum  likaiihood  mathod  it  aatimatad  by  7 NN  computed  at  follows 


of  tha  ratrassion  aatimatad  by  tha 

a — i « ..  . . . - 
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from  which  follows 
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this  reduces  to 


WM  ^*NN  “ ^NN  A*NN^’nnF 
An  unbiased  estimate  of  ZNN  Is  given  by 


T * m ft  r> 
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From  tha  above  formulation,  wa  can  identify  the  total  turn  of  ^quarts  as  the  diagonal  matrix 
n*NN  and  the  sum  of  squares  due  to  regression  as  0MigA*NM/j  nn- 

GEOMETRIC  INTERPRETATION! 

The  matrix  fl  NNA*NN  0 HN  is  the  matrix  of  the  vector  products  of  the  regression  based 
eat  I mat  as  i 

This  can  ba  seen  as 

- f - I - I 

YNM  YMN  * ^ NN  *NM  X MN  & NN 

where  XNM  XMN  Is  identified  as  A*NN-  Viewed  in  the  geometrical  context,  tha 
diagonal  terms  of  4NN  A*  NN  fi  NN  ere  the  lengths  of  the  vectors  . The  off  diagonal 
terms  ara  tha  vector  products  y.m,'YJ*»  <*J- 
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Thus 
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Thus  the  off  diagonal  terms  may  be  negative  if  cos0  < O.  However,  since  the  vectors  YNM 
are  ideally  orthogonal,  the  angle  0 is  an  error.  By  this  argument,  it  is  of  no  great 
consequence  that  the  off-diagonal  terms  of  2NN  may  be  negative  in  the  computation. 

While  interpreting  the  regression  geometrically,  consider |?NNXNM  as  the  projection  into 
X space  of  Y NM  in  Y space.  Then|j3  x\/\ Yjis  the  cosine  of  the  angle  between  YNM,  and 
its  projection  ^nnXnm  Graphically,  this  would  appear  as  fol.ows  considered  in  two 
dimensions. 


*1 


where 

I Y | cos  0 » I 0 X i . 

Thus 
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Since  the  vectors  YNM  are  not,  in  general,  completely  orthogonal,  and  that  the  matrix 
&*NN  is  a diagonal  matrix,  the  off-diagonal  elements  of  R2NN  rnay  be  negative.  However, 
we  are  actually  only  concerned  with  the  diagonal  elemenfs;  therefore,  we  can  compute 


R NN  A NN  0 NNi2  NN  ' 5NN‘ 

The  quantity  R 2 N N is  interpretable  as  the  square  of  the  multiple  correlation  coefficients  of 
the  regression  equations  0NN  . 

An  F — test  against  the  hypothesis  C 0 * 0 can  be  provided  from  R2  by 


- r2  . M - N > 

1 - R2  N - 1 ~ N~’- 


( t) 


which  is  equivalent  to 
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NN 
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Note  that  for  the  off-diagonal  elements,  the  F — ratio  is  negative. 

The  multiple  correlation  coefficient  squares,  interpreted  as  the  amount  of  variance  of 
Y ^ explained  by  0|N  XNM  , can  be  transformed  into  the  coordinate  system  of  FNM  so 
that  the  amount  of  variance  of  F variables  explained  Dy  the  D variables  can  be  estimated. 
The  transformation  is  simply  the  diagonal  terms  of 


* t q {gnn  °nn  (*nnr2nn  ) G nn  | • 

However,  it  is  not  clear  just  how  useful  this  information  is.  What  is  useful  to  the  user  is  a 
regression  transformation  from  D to  F and  an  expression  for  the  confidence  intervals  on  the 
estimates  FNM  . 


The  transfer  function  for  computing  F is  simply 


F = (G  0 E'  }D 
NM  NN  NN  NN  NM 
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where  the  expression  GNN0NN  E NNis  identified  as  the  transfer  function 
tnn  * gnn  ^nn  e NN- 

The  calculation  of  the  confidence  intervals  proceeds  as  follows  and  will  illuminate  some  very 
useful  consequences  of  the  independence  and  orthogonality  properties  of  the  principal 
components. 

The  variance/cowariance  of  the  regression  equations  can  be  computed  by  the  following 
procedure: 


Vtf.j)  - *{0f-0,)(0,-0j>' 

V«j„>-  qj(Vtt-Y|a)(VJa-Yia)'XMa  >:m  XA*NN-'A-NN-'l 
Y(0|j)  ■ E ]E(Y|a— Y(a  )(Yja  — Yja  ) A „„  '} 

V<pit)  - ®„  • A*mn-  ' 


For  i,  j • 1 , N this  becomes 

V0SNN)  nn  x a hn  1 
Where  ZJ^N  is  the  unbiased  estimate  of 

Recalling  the  previous  argument  regarding  the  off-diagonal  elements  of  N N . we  may 

ignore  them  in  which  case  V (0Nr<)  is  a diagonal  matrix  of  dimension  n*  x N>.  I*  is  worth 
noting  that,  in  general,  the  off-diagonal  elements  are  usually  at  least  one  or  more  orders  of 
magnitude  down  from  0n  A*,  ,0  (J. 

The  matrix  V(  p ) can  be  partitioned  as: 

V<0)  = 
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is  a diagonal  matrix  for  the  K™ 


. The 


Where  the  submatrix 
submatrices  are  found  by 
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row  of0NN  , (3kn 


U NN  * aKK 
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where  aKK  is  the  K™  diagonal  element  of  Z^'and  X*"*  are  th*  diagonal  elements  of 
Thus,  it  is  clear  that 


cov(PkS>^  km)  * °*  C * M 

as  a consequence  of  the  orthogonality  of  ^nm*  This  ‘TOP*’®5  that  the  joint  confidence 
regions  of  each  of  the  regression  equations  are  entirely  unambiguous.  This  is  vital  to  the 
interpretation  of  the  confidences  intervals  of  the  estimates;  FNM  .as  legitimate  intervals.We 
can  also  see  how  the  variance  of  the  0 increases  for  the  less  important  components  of  X for 
any  given  component  in  Y. 

The  independence  of  the  0)N  for  any  given  i , is  of  great  help  in  the  application  of  the 
regression  analysis.  This  comes  about  when  one  recalls  how  the  principal  component 
transformation,  in  addition  to  its  properties  of  independence  and  orthogonality,  also 
possesses  the  property  of  having  partitioned  the  variance  of  D ind  F into  modes  of  variance 
which  form  a decreasing  series  of  relative  contrfeution  to  the  total  variance  of  the  original 
data,  D and  F.  Of  concern  here  is  the  set  O leading  to  X.  If  the  set  XNM  is  too  large  in  the 
dimension  N as  to  be  undesirable  one  can  select  those  components  which  contain  a 
prescribed  amount  of  variance  less  than  100  percent.  Thus,  XNM  is  replaced  by  X pM  ; p< 
ft. The  assumption  that  the  principal  components  have  to  be  sorted  has  been  made. 

This  selection  of  p components  will  cause  the  regression  equation0NNto  be0Np.What  is 
important  here  is  that  the  remaining  P coefficients  are  unaffected  by  the  rejection  of  the  last 
N—  p coefficients.  Of  course,  the  R2NN  is  lowered,  but  then  the  F — ratio  may  be  increased 
because  of  the  change  in  the  degrees  of  freedom  involved.  The  price  paid  for  this  reduction 
in  the  number  of  variables  in  X is  that  one  may  not  know  a priori  which  modes  of  variance 
in  X will  correlate  with  any  one  of  the  modes  of  variance  in  YNM  . The  decision  must  be 
made  on  the  results  of  seeing  all,  or  at  least  those  allowed  by  computer  limits  tried  first.  For 
this  reason,  an  interaction  of  the  user  is  required  in  the  use  of  this  analysis.  Also  the  F — test 
(as  will  be  seen  later)  can  be  used  in  conjuetion  with  the  selection  of  p to  improve  the 
confidence  interval  calculation.  The  set  YNM  may  also  be  reduced  leading  to  0qp  ,q<  n , q 
a p.  However,  this  would  preempt  the  F — test  and  therefore  shor'd  be  used  only  to  satisfy 
the  computer  limits. 
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This  confidanca  Interval  calculation  procaads  at  follows.  Wa  start  with  tha  confidence 
intarval  calculation  of  0 NN  : 

Conf  (1-o.  4nn)  ■ t(M— N— 1,  1-a/2)  • y/Vffl? 

For  an  individual  row  of  tha  regression  matrix  this  bacomas 


Conf  (1— a,  0KM)  ■ t(M— N—  1,  1—  o/2)  y^u*NN? 

This  can  ba  axpandad  to  com  put  a tha  confidanca  Intarval  of  tha  rasults  if  an  operation 
indicated  by  $,M  is  performed  to  eonvart  tha  M x M matrix  com  put  ad  by 

Conf  (1-ot,  Ykm)  - t(M-N-1, 1—  ot/2)  * ^x'MM  if*  tin 

Into  a 1 x M matrix  .correspond inf  to  tha  tc™  row  of  YNM  . 

That  is 

Conf  (l—o,  Y K 1#|  ) • t(M— N— 1,  1— a/2)  • j £ 1M  £x  mn  ^nn 
which  can  ba  re-written  as 

$Y*|#j|  • t(M— M— 1,  1—0/2)  • |{j|(|>#|NEf(Hu,tNNENNDNM  j. 

If  this  operation  is  dona  K*  1.  N thnas,  m bacomas  a matrix  8 *NMof  ««nfidanea 
Intervals  of  YN|)|.  Note  that  $Y  dacreasas  as  tha  rank  of  t/(NNdecreases.  This  matrix  of 
Intarvals  can  than  ba  transformad  back  into  F— spaca  by  GNN.  Thus  wa  gat  the  « — a 
confidanca  intarvals  of  F^  by 

^nm  " gnn  * yn»s- 

If  tha  calculation  of  is  parformad  using  Independent  data,  D*Nq  as  would  be 

eppllad  to  TNN  , tha  calculation  would  appaar  as 

c # t r\* 

p NO  ■ TNN  DH0 


by  5 


and 


Note  that  E and  G are  from  the  calibration  data,  D and  F,  used  to  compute  T and 

' ^ Pi  IN 

v(  0 NN).The  condition  on  D Nq  is  that  it  comes  from  the  same  distribution  as  did  D NM  . 
Note  that  if  E and  G come  from  the  correlation  matrix,  then  the  estimates  Ft  j F are  in 
units  of  standard  deviations. 

A further  refinement  in  the  accuracy  of  the  regression  (over  that  of  eliminating 
unnecessary  components  in  XNM)  can  be  introduced  by  using  the  F— test  to  reject 
(suppress  to  zero)  entire  regression  equations.  This  has  the  effect  of  setting  to  zero 
components  of  Y NM  which  have  insufficient  probability  of  being  more  meaningful  than 
zero.  This  amounts  to  a kind  of  stepwise  regression  except  that  the  elimination  of  some  of 
the  components  estimated  in  leaves  the  remaining  components  unaffected  since  they 
are  independent. 

The  application  of  the  F— test  'ejection  involves  the  calculation  of  R^p  ; P < n 
according  to  the  amount  of  variance  desired  by  the  user  based  on  experience.  From  the 
R2np.  the  F— ratio  is  calculated.  Those  F — ratios  failing  the  minimum  value  (95  % 
confidence  level)  cause  the  corresponding  rows  of  0 NN  and  submatrices  uKNN  to  be  set  to 
zero  and  the  calculation  of  TNN  , FNN,  and  V (FNN)  is  repeated.  The  user  can  then 
manipulate  p = p (%  var.  F)  until  the  confidence  intervals  of  FNN  appear  to  be  optimum.  It 
'houid  be  noted  that  in  most  cases  the  values  of  t(M— N— 1 , 1— a/2)  do  not  change  to  much 
for  changes  in  n to  p amounting  to  a few  integers,  if  M is  several  times  as  large  as  N.  The 
value  of  the  F-ratio,  FM_p,  p—  i (7),  can  be  estimated  from  a simple  polynomial  in  (m  - 
p)  with  sufficient  accuracy  for  use  here. 

The  Var  (F)  can  be  estimated  as  mentioned  earlier  compounded  by  the  amount  of  variance 
corresponding  to  the  number  of  principal  components  Y passing  the  F— test  rejection. 

It  is  important  to  realize  the  physical  implication  of  the  means  by  which  the  accuracy  of 
the  regression  is  improved.  By  the  initial  assumptions  about  the  data  DNM  and  FNM  , we 
claim  normality  and  a representative  sample  of  the  behavior  of  the  observed  phenomena  for 
all  time.  Further,  we  postulate  a modal  nature  of  tne  behavior  or  variance  of  the  system  as 
described  by  the  N-variablcs.  The  modal  nature  of  the  variance  is  further  postulated  to  be 
multimodal  with  modes  numbering  up  to  N and,  in  general,  being  of  differing  relative 
magnitude  which  linearly  add  up  to  comprise  the  total  variance  of  the  system. 


496 


With  these  observations  in  mind,  we  can  now  understand  what  is  happening  in  the 
regression  situation.  When  one  or  more  of  the  least  important  principal  components  of 
X NM  are  omitted,  we  are  '.(aiming  that  those  modes  of  variance  of  Ohave  an  insignificant 
statistical  relationship  to  any  of  the  modes  of  variance  of  F.  When  we  reject  any  of  the 
regression  equations  by  the  F—test  rejection, we  are  claiming  that  the  mode  of  variance  of  F 
represented  by  that  regression  equation  has  an  insignificant  statistical  relationship  to  all  of 
the  modes  of  variance  of  X used  in  the  equation.  It  is  important  to  reaffirm  that  one  cannot 
say  a priori  which  components  of  X will  correlate  hignly  with  which  components  of  Y.This 
will  be  clearer  upon  inspection  of  the  demonstrated  regression  following.  In  any  case,  we 
can  now  understand  that  we  are  using  as  many  as  possible  of  the  modes  of  variance  of  X 
that  seem  to  have  some  significant  statistical  relationship  to  at  least  one  of  the  modes  of  Y 
which  passes  the  F—test.  Further,  we  are  allow  ing  only  those  moies  of  variance  of  Y to  be 
estimated  which  have  a significant  probability  of  not  haying  been  estimated  by  chance  to  be 
used  to  reconstruct  the  regressand.  F.  In  this  way,  we  can  see  that  it  may  well  be  possible 
that  the  modes  of  variance  of  X and  Y that  have  a significant  statistical  relationship  may  or 
may  not  be  the  dominate  modes  in  each  and  in  any  case  the  regression  based  estimate  of  F is 
a composite  of  significant  modes  estimated  in  Y without  the  interference  of  the  insignificant 
ones.  It  may  be  possible  to  further  improve  the  estimate  by  selectively  eliminating  the 
components  of  X for  each  regression  equation  in  which  the  associated  regression  coefficient 
is  insignificant.  However,  this  would  cause  the  degrees  of  freedom  for  each  estimate  Y jM 
to  be,  in  general,  different  than  for  the  other  estimates.  This  would  cause  a rather 
cumbersome  complication  in  the  software  and  it  is  not  clear  just  how  beneficial  it  would  be 
since  the  primary  impact  is  on  the  confidence  interval  and  not  the  estimate.  Perhaps  further 
work  on  the  problem  may  answer  these  questions. 

APPLICATION: 


The  regression  analysis  described  in  this  paper  has  been  implemented  into  two  matching 
software  packages:  CORMAT  and  REGRESS.  Attendant  to  these  packages  are  two 
subroutines;  CLEAR,  which  simply  zero’s  out  an  array,  and  a CDC  library  subroutine 
MATRIX  which  performs  matrix  operations.  The  programs  CORMAT  and  REGRESS  are 
written  so  as  to  be  used  as  subroutines  themselves  in  a parent  program  which  reads  and 
formats  the  data  DNM  and  F^j^  . D is  selected  and  formatted  by  another  subroutine: 
SIGNAL. 

The  program  CORMAT  computes  the  correlation,  or  co-variance,  matrix,  depending  on 
how  it  is  called  and  also  the  eigenvalues/eigenvector  and  the  principal  components.  The 
number  of  components  computed  is  determined  by  the  amount  of  variance  requested  to  be 
accounted  for.  The  maximum  number  of  components  is  limited  by  the  length  of  data  and 
the  size  limitations  of  the  machine.  It  is  worth  noting  that  the  program  CORMAT  wili 
compute  the  co-variance  matrix  about  a mean  value  given  to  it  which  may  be  other  than  the 
mean  value  of  the  data  supplied.  In  this  way,  one  may  investigate  the  modes  of  variance 
about  a mean  value  from  another  distribution. 
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The  output  of  CORMAT  is  the  correlation,  or  co-variance,  matrix,  the  list  eigenvalues,  the 
set  of  eigenvectors  and  the  set  of  orincipal  components  computed  accounting  for  the 
prescribed  amount  of  variance  of  the  input  data.  CORMAT  writes  the  principal  components, 
the  reduced  (if  variance  accounted  for  < 100%)  set  of  eigenvectors  and  the  list  of  eigenvalues 
used  on  a random  access  file  and  then  returns.  The  number  of  eigenvectors  used  to  compute 
the  principal  component  are  transferred  between  subroutines. 

Subroutine  REGRESS  uses  the  principal  component  sets,  the  eigenvector  sets  and  the 
independent  regressor  data  (referred  to  as  signal  data)  to  compute  the  regression 
coefficients,  the  transfer  function,  the  multiple  correlation  coefficients,  the  F— ratio  and  the 
regression  based  estimates  of  the  regressand  from  the  signal  data.  Subroutine  REGRESS  also 
performs  the  F— test  rejection  computation. 

For  an  example  application,  the  situation  is  the  regression  of  tree  ring  data,  the  regressor, 
taken  from  the  Fraser  River  Basin  against  matching  precipitation  data;  the  regressand, 
occurring  during  the  summer  months  at  Kamloops  Meteorological  Station,  Kamloops, 
Alberta,  Canada.  The  calibration  data  runs  for  49  years  from  1896  to  1944.  The  two  data 
sets  are  lagged  by  14  yean.  That  is,  the  first  column  contains  yean  1 (referred  to  1896) 
through  14,  the  second  column  yean  2 through  15  and  so  on  to  column  36  containing  years 
36  to  49.  The  signal  data,  D *Nq  , is  composed  of  columns  selected  from  D nm  . the  tree 
ring  data  dating  from  1500  to  1944,  such  that  the  tree  ring  indicias  in  row  1 correspond  to 
yean  in  which  a large  volcano  erupted  in  the  region  prescribed  by  the  limits  of  long  0 °to 
135°W,  latitude  20°N  to  90°N.  In  appendix  A are  copies  of  the  printout  of  the  program 
with  the  conditions  on  percent  of  variance  accounted  for  and  F— test  as  descr  ibed  in  the 
printout.  Two  other  cclculations  were  performed  requesting  100%  and  80%of  the  variance  in 
Dns.  The  effect  can  be  seen  in  Figure  2 where  the  plots  of  F NS  + ft  PrNS  for  the 
estimated  precipitation  in  years  1783  — 1796  are  shown  as  an  example.  The  curves  are:  (a) 
100%  variance,  with  no  F— test,  (b)  100%  variance  with  F— test,  (c)  95%  variance  with 
F~ test,  (d)  90%  variance  with  F--test  and  (e)  80%  variance  with  F— test. 

Upon  inspection  of  Figure  2 we  can  see  several  effects  at  work,  all  of  which  involve  the 
user  as  a student  of  the  phenomena  being  analyzed  rather  than  as  a purely  detached 
statistician. 


First,  one  sees  the  changing  nature  of  the  estimate  F ,s  asfewer  modes,  or  principal 
components,  of  X are  allowed  in  the  re^r*  >ion.  Secondly,  one  notices  that  the  90% 
confidence  bounds,  SF,  of  F vary  from  one  element  to  another  within  each  of  the  rows  of  F 
for  each  case  (variance  accounted  for  in  X).  This  is  Co  be  expected  when  one  recalls  the 
modal  nature  of  the  decomposition  of  the  data  into  principal  components.  For  any  given 
mode  of  variance  some  of  the  variables  may  be  emphasized  and  others  may  not.  This  is 
evident  upon  inspection  of  the  associated  eigenvectors.  This  ns  equivalent  to  identifying 
which  variables  are  contributing  significant  amounts  of  variance  to  a particular  mode  and 
which  are  simply  supplying  noise. 

On  the  other  hand,  if  the  noise  is  evenly  distributed  among  the  variables  and  if  the  entire 
mode  is  essentially  a noise  mode  with  none  of  the  variables  containing  any  significant 
amounts  of  signal,  then  none  of  the  elements  of  the  associated  eigenvector  will  be 
prominent.  If  the  noise  is  not  eveniy  distributed,  some  of  the  elements  may  be  prominent  in 
a noise  mode.  However,  remember  that  noise  is  random  and  unlikely  to  correlate  with 
another  set  of  data.  Thus  the  coefficent  0 will  be  small  and  the  variance  VKNN  (B)  will  be 
large. 

When  one  remembers  that  the  modes  are  themselves  partitioned  with  respect  to  the 
variance  of  the  original  data,  it  is  easy  to  see  how  a variable  contributing  mostly  noise  in  a 
dominate  mode  (dominated  itself  oy  signal)  may  still  overpower  the  contribution  of  that 
same  variable  contributing  mostly  signal  in  a lesser  mode. 

Another  fact  which  must  be  considered  fr -rally  when  inspecting  the  estimates  F is  whether 
or  not  the  noise  evidenced  by  &F  is  caused  by  uncertainty  in  0 or  by  the  physical 
phenomena  itself.  This  problem  is  largely  self  correcting  to  be  one  and  the  same  when  one 
assumes  that  the  noise  should  be  highly  uncorrelated  between  the  sets  D and  F and  also 
recalls  that  the  j3  are  independent  within  each  regression  equation.  Thus,  the  regression 
coefficients  should  be  essentially  zero  for  noise  and  this  in  turn  will  cause  V*NiNI  to  be 
large.  Therefore,  by  disregarding  an  estimate  in  one  case  (variance  accounted  for)  because  of 
a large  5 F,  one  is  always  sure  of  not  overlooking  a valid  signal  and  by  the  same  argument, 
keeping  an  estimate  F . . from  one  case  because  of  a small  5 F and  plotting  it  with  another 
similarly  good  estimate  F jKfrom  a different  case,  each  with  their  original  5 F’s,  one  is 
simply  combining  good  estimates  of  F from  D and  disregarding  noise.  In  a sense,  one  is  simply 
keeping  those  components  of  X and  Y which  contain  mostly  signal  and  discarding  those 
which  contain  mostly  noise. 

Using  these  arguments,  the  final  regression  based  estimate  of  pre-summer  precipitation  in 
Kamloops,  Canada,  for  14  years  after  the  Icelandic  volcano  eruption  in  1783  appears  as 
shown  in  figure  3.  The  units  are  standard  units  of  deviation  about  the  mean  and  the  error 
bars  are  95%  confidence  bounds. 
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Figure  3 


Barter*  leaving  the  topic  of  inspection  of  results,  not*  the  transfer  function  itself  (platas  6, 
7 and  10).  The  reader  will  note  the  occurrence  of  “ridges”  and  "valleys"  running  diagonally 
from  rows  1 and  6 and  column  0. 

Inspections  such  as  this  of  the  transfer  function  and  also  the  eigenvectors,  can  reveal 
the  likelihood  of  physical  relationship  between  and  within  the  sets  F and  D worthy  of 
future  causal  Investigations. 
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PREDICTING  METASTASIS  OF  ENUCLEATED  SMALL 
OPHTHALMIC  MELANOMAS  BY  DISCRIMINANT  FUNCTION 

Walter  D.  Foster 
Ian  W.  McLean  (Maj  , MC,  USA)  t 
Araed  Forces  Institute  of  Pathology 
Washington,  D.  C.  20306 

ABSTRACT . Malignant  aelanonai  of  the  uveal  tract  of 
the  eye  are  tuaora  with  a significant  risk  of  aetaataaie. 

To  reduce  this  risk,  it  has  been  the  practice  of  ophthamolo- 
glets,  after  asking  a clinical  diagnosis  of  malignant  melanoma, 
to  recoaaend  immediate  enucleation  of  the  eye.  The  objective 
of  this  study  was  to  search  for  a criterion  by  which  the  risk 
of  aetaatasis  could  be  estimated. 

Over  300  cases  of  enucleation  for  small  malignant  mela- 
nomas have  been  referred  to  the  Ophthalmic  Pathology  Division 
of  the  Armed  Forces  Institute  of  Pathology  (AFIP)  for  inves- 
tigation and  research.  Of  these  cases,  pathologists  recorded 
16  characteristics  on  each  of  72  eyes, together  with  informa- 
tion on  whether  the  tuaor  had  metastasized.  Analysis  by 
stepwise  discriminant  function  was  employed  to  suggest  which 
of  these  characteristics  might  be  predictive  of  metastasis 
and  the  degree  of  their  effectiveness.  An  unexpected  dividend 
in  the  use  of  the  discriminant  function  was  the  redefinition 
of  some  of  the  characteristics  and  the  review  of  the  original 
data  for  others,  in  a medico-statistical  dialog  in  the  refine- 
ment of  the  capability  for  prediction.  The  following  table 
shows  the  degree  of  success  of  the  analysis  for  the  body  of 
data  at  hand: 

Table  1,  Comparison  of  Classification  by  Discriminant 

Function  with  Actual  Behavior  in  72  Cases  of  Small  Oph- 
thalmic Melanoma 

Correct  Incorrect 
Prediction  Prediction 

Actual  group 

Nonmetastaaizing  (40)  34  6 

Metastasizing  (32)  27  5 


1.  INTRODUCTION.  A major  ophthalmic  problem  is  con- 
cerned with  the  decision  whether  to  advise  enucleation  of 
the  eye  when  a small  intraocular  melanoma  has  been  dis- 
covered, The  decision  to  remove  the  eye  depends  hsavily 
upon  ths  risk  of  metastasis.  In  the  case  of  small  tumors 
of  the  choroid,  the  surgeon  is  faced  with  the  difficult 
decision  of  whether  to  remove  the  eye  or  continue  to  ob- 
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nrvt  the  lesion  until  there  is  greater  certainty  that  It  is 
■allgnent.  The  purpose  of  this  research  was  to  discover 
whether  there  is  a basis  for  estimating  the  risk  of  metastasis. 

The  Ophthalaic  Pathology  Division  of  AFIP  has  the  largest 
known  collection  of  eyes  enucleated  for  small  malignant  mela- 
nosaa--over  300  cases.  After  defining  15  characteristics  aa 
possible  predictors  of  metastasis,  pathologists  selected  76 
tumor-containing  eyes  enucleated  prior  to  1945  because  all 
needed  data  were  available;  34  tumors  were  known  to  have  .re- 
sulted in  metastasis,  and  42  patients  were  free  of  metastasis 
at  the  last  known  status  7 or  more  years  after  enucleation. 

Fisher's  linear  discriminant  f*  tlon  was  chosen  aa  the 
statistical  function  for  the  class!  ..cation  of  these  melanomas 
on  the  basis  of  the  15  predictors.  Analysis  by  stepwise 
discriminant  function  to  order  the  predictors  in  terms  of 
their  relative  predictability  was  envisioned  as  a process 
for  identifying  ths  most  meaningful  set  of  predictors  to  be 
compared  with  the  list  of  predictors  selected  by  pathologists 
from  medical  experience  for  intraocular  melanomas  of  all 
sizes. 

?.  DISCRIMINANT  FUNCTION.  To  define  the  linear  dis- 
criminant function, 

Let  X i - 1-th  characteristic,  e.g.,  slse;  i ■ 1 - 15, 


Let  B^  - coefficient  of  X^  to  be  estimated. 


Set 

zx  - I 

BiXi 

for 

melanomas 

and 

Z2  - E 

BfXi 

for 

Let 

D » Zl 

“ Z2 

and 

di  “ Xin  - Xi« 

that 

D - Z 

Bidi 

and 

V(D)  - EE  " 

5. 

For  analogy  with  the  univariate  case,  just  as  we  wish  to 
maximize  _ _ .<2 

t ■ ( X ^ - X2)/s(l/nj  + l/n2>A  or  its  square, 

In  the  discriminant  function  the  are  estimated  by 
maximizing  D2/S: 

J j.2  In 

Ty  ■ ■■  **  0,  whose  resulting  equations  have  the  solution 

oB  ^ 

B ^ - (S/4)  l d^a^  where  a*^  is  an  element  of  the 

inverse  of  the  varlance-cuvarlence  matrix  of  the  , pooled 
over  the  two  groups  under  the  assumption  of  homoscedastlcity . 
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The  constant  S/D  has  no  meaning  as  Car  as  the  discriminant 
function  is  concerned  and  can  be  arbitrarily  equated  to 
unity  for  simplicity. 

The  assumptions  for  the  probability  statements  implicit 
in  the  use  of  the  discriminant  function  required  that  the 
predictor  variables  be  continuous  and  have  a joint  multi- 
variate normal  dirtribution  and  that  the  variance-covariance 
matrix  for  each  group  be  ejual.  Therefore,  the  following  liat 
of  proposed  predictors  (characteristics)  was  examined  in 
terms  ol'  its  marginal  distribution  properties. 


I 

j 

i- 

\ 


Predictor  Predictor  Univariate  properties 

Number  


1 Age  Continuous,  approximately  normal 


2 

Duration 

Continuous , 

skewed 

to 

right 

3 

Enucleation  date 

Continuous , 

skewed 

to 

left 

4 

Size 

Continuous  , 

skewed 

to 

right 

5 

Volume 

Continuous  , 

skewed 

to 

right 

6 

Area 

Continuous , 

skewed 

to 

right 

7 

Sex 

Two-class , 

uniform 

8 

Posterior  margin 

Nine-class  , 

skewed 

to 

right 

9 

Anterior  margin 

Nine-class , 

skewed 

to 

right 

10 

Eye 

Two-class  , 

uniform 

11 

Cell  type 

Four-class  , 

skewed 

12 

Pigment 

Four-class  , 

skewed 

to 

right 

13 

Fiber 

Five-claaa , 

skewed 

to 

right 

14 

Scleral  extension 

Four-claaa , 

skewed 

to 

right 

15 

Optic  nerve  " 

Four-class , 

skewed 

to 

righ  i 

Three  results  were  immediately  obvious.  ] 

Not 

only  was 

the 

assumption  of  multivariate  normality  invalid. 

but  it  also 

was 

clear  that  the  covariance  matrices 

of  the 

two 

groups, 

metastasizing  and  noniuetas  tas  izing , were  not  equal.  More- 
over, the  additive  model  was  at  best  a first  approximation. 
Nevertheless,  it  was  felt  that  an  imperfect  approach  could 
be  tried  and  judged  on  its  performance.  The  UCLA  BMD  pro- 
gram, stepwise  discriminant  function,  was  utilized.  This 
program  selects  as  the  first  predictor  and  as  successive 
predictors  in  turn  that  one  ior  which  the  likelihood  ratio 
expressed  in  terras  of  the  F-statistic  is  a maximum.  The 
advantage  of  this  approach  is  obvious--the  set  of  predic- 
tors le  ordered  and  can  be  truncated  at  any  point  by  the 
experimenter.  Further,  in  this  study  it  offered  a compari- 
son of  predictors  selected  in  this  fashion  to  those 
previously  selected  by  the  pathologists  f rom  experience. 


MEDICO-STATISTICAL  INTERACTIVE  DIALOG.  The  results 
of  o u 1 first  run  with  the  stepwise  discriminant-function 
program  are  given  in  Table  2.  It  was  clear  from  this  run 


* 

i 
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Unit,  «»l'  tin*  15  original  predictor*,  no  nor*  than  8 were 
ollectlvc  In  coabinatlon.  An  astounding  finding  was  the 
Culture  of  predictor  111*  call  type,  which  waa  tha  leading 
choice  by  pathologists'  aiperianca,  to  be  included  in  the 
Hat  of  affective  predictora.  Also  unexpected  were  the 
negative  aigna  for  the  coafficienta  for  predictor  #2, 
duration,;  #1,  age;  #3,  enucleation  data;  and  #6*  area. 

In  effect,  the  data  were  contradicting  the  notion  that  the 
probability  for  metastasis  increaaea  with  inereaeing  age, 
duration  of  tha  aelanoaa,  area  of  tha  aalanoaa,  etc.  Tha 
error  rate  for  falae  poaitlvea,  F+ , defined  aa  nonaetasta- 
aixing  casea  erroneously  classified  by  the  function  aa 
aetastatic,  waa  5/42,  or  .12,  while  that  for  falaa  nega- 
tives, F-,  defined  as  aetastatic  cases  erroneously  classi- 
fied aa  nonaetaatatic,  was  11/34,  or  .32,  for  a total  error 
of  16/76,  or  .21.  Our  review  of  these  results  included  a 
detailed  examination  of  those  cases  that  were  aisclaaai- 
fied  by  the  diacriainant  function.  This  review  revealed 
inconsistent  criteria  for  call  type  and  three  cases  that 
should  not  have  been  Included  in  the  study. 

The  aecond  run,  with  the  value  for  cell  type  revised 
by  th%  consensus  of  three  pathologists,  selected  the  pre- 
dictors in  the  order  shown  in  Table  2.  The  review  of  run 
12  found  yet  another  case  erroneously  Included  in  the  orig- 
inal set  of  data.  It  waa  of  interest  that  the  refined 
definition  of  cell  type,  predictor  #11,  was  Included  in  the 
group  of  aeaningful  predictors.  Only  fiber  content  as 
a predictor  in  the  pathologists'  list  failed  to  be  included 
in  the  group  of  aeaningful  predictors  in  run  I 2,  although 
it  was  noted  that  age  continued  to  have  a negative  coef- 
ficient. At  this  point,  it  was  decided  to  add  a 16th  pre- 
dictor, aitotlc  activity,  for  the  next  run. 

Run  #3,  shown  in  Tabla  2,  did  include  the  new  pre- 
dictor, aitotlc  activity,  but  unexpectedly  dropped  cell 
type.  The  total  error  rate  stayed  about  the  same  as  before 
despite  a slight  shift  in  the  F+  and  F-  rates.  Pathologists 
opinion  did  not  agree  that  #1  (age),  #2  (duration),  and 
if  3 (enucleation  date)  were  physically  aeaningful  and 
recowmended  that  these  as  well  as  #15  (optic  nerve  exten- 
sion) be  dropped  as  predictors  for  the  next  run. 

Run  #4  did  not  discriminate  aa  well  as  runs  42  and  3. 

( ts  overall  error  rata  was  15/72,  with  F-f  as  7/40  and  F- 
**»  8/32.  It  also  dropped  both  cell  type  and  aitotlc  acti- 
vity ai  aeaningful  predictors.  It  did  continue  to  show  an 
acceptable  level  of  discrlalnation. 
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With  theta  ■■  the  results  thus  far,  us  reminded  ourselves 
thtt  It  has  been  the  thrust  of  this  preliminary  paper  not  so 
much  to  list  asdical  findings  or  implications  (vhich  mill  be 
reported  elsewhere)  as  to  suggest  the  value  of  the  continuing 
medico-statistical  Interactive  dialog  in  the  winnowing  proc- 
ess of  finding  and  redefining  meaningful  predictors. 

4.  CONCLUSIONS.  The  discriminant-function  approach  ap- 
pears to  offer  considerable  promise  to  serve  as  a basis  for 
estimating  risk  probabilities  as  a help  to  medicel  practice 
in  evaluating  small  ophthalmic  melanomas.  Future  investi- 
gation in  this  specific  direction  will  Include  (1)  the  use 
of  this  discriminant  function  on  a new  population  to  estimate 
true  error  rates  and  to  improve  overall  predictive  ability, 

(2)  the  reformulation  of  the  prediction  function  to  allow 
greater  flexibility  than  offered  by  the  linear  terma,  such 
as  "product"  or  "reciprocal,"  or  special  relationships 
among  the  variables,  and  (3)  possible  use  of  transformations 
toward  achieving  normality. 


The  opinions  or  assertions  contained  herein  are  the 
private  views  of  th  authors  and  are  not  to  be  construed 
as  official  or  as  reflecting  the  views  of  the  Department 
of  the  Army  or  the  Department  of  Defense. 


530 


• i-  v * , 


FORECASTING  MODELS  FOR  MOSQUITO  POPULATION  BEHAVIOR 


Stephen  Smaach  and  Chris  P.  Tsokos 
Department  of  Mathematics 
University  of  South  Plorlda 
Taupe,  Florida  33620 


ABSTRACT.  Tha  object  of  this  papar  la  to  develop  statistical  Models  to 
forecast  eosqulto  densities  up  to  a specific  desired  tins  In  advance. 

It  is  shown  that  the  eosqulto  series  la  a non -stationary  stochastic 
realisation.  A procedure  Is  presented  in  no deling  tha  eosqulto  densleites 
for  the  purpose  of  forecasting  one,  two,  three.  ...,  k days  ahead.  Auto- 
regressive, moving  average  and  nixed  aut ora greas ive-oovin g average  models 
have  been  utilised  for  the  purpose  of  predicting  mosquito  density  behavior. 

In  addition,  tha  technique  of  utilising  the  formulated  models  in  a 
simulation  study  to  determine  the  Influence  of  several  pesticide  application 
strategies  is  briefly  discussed. 

1.  JNM0VUCT10N . Aside  from  the  nuisance  factor  associated  with  the 
presence  of  mosquitos  In  the  human  environment,  it  is  of  interest  to  develop 
control  strategies  for  mosquito  populations  since  they  serve  as  essential 
links  in  the  life  cycles  of  a number  of  human  parasites.  The  incidence  of 
such  parasites  can  be  controlled  by  reducing  the  population  density  of  their 
mosquito  vectors.  Control  techniques  can  take  the  form  of  pesticide  spraying 
strategies  and  alteration  of  the  mosquito  larval  habitats.  The  development 
of  accurate  statistical  models  to  predict  future  mosquito  densities  can  be 
used  to  advantage  by  scientists  studying  control  of  mosquito-related  diseases. 
Such  statistical  models  could  be  used  to  simulate  population  density  behavior 
under  various  control  strategies  and  hence  serve  as  an  evaluation  of  control 
strategies,  Independent  of  field  tests. 

In  the  present  Investigation,  statistical  models  are  formulated  to  predict 
mosquito  population  densities  up  to  four  days  in  advance.  The  procedures 
used  are  those  developed  by  C.F.  Tsokos  [2  ] for  use  in  formulating  forecasting 

models  from  non-stationery  time  series.  The  data  used  in  this  Investigation 
consists  of  three  years  of  light-trap  capture  data  of  adult  female  mosquitos 
(Culax  tarsalla)  collected  at  two  day  Intervals  from  light  trap  stations  la 
Malvern,  Iowa  during  1969,  1970  and  1971.  Hacker,  Scott  and  Thompson  [ l ] 
have  analysed  this  data  using  a somewhat  different  approach.  Professor  Thompson 
dlacuasad  their  investigation  with  the  present  authors  and  kindly  provided  the 
data  for  our  Independent  analysis. 

He  shall  be  concerned  with  an  Important  class  of  statistical  models,  viz,  the 
autoAi.gKeMi.vt  pKoctU,  the  moving  avtKagt  pKoctM  and  the  mixed  autoKtgKtUivt- 
moving  avtKagt  jpKO ctU,  These  processes  have  been  widely  used  for  describing 
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stationary  time  sariaa  (i.a.,  those  tine  series  that  are  in  statistical 
equilibrium  about  a constant  naan  level).  However,  much  biological  data  is 
non-stationary.  One  can  transform  non-station ary  data  in  such  a manner  that 
it  can  be  treated  as  a stationary  series.  Such  transformations  consist  of 
applying  an  appropriate  filter  to  the  observed  time  series  to  "filter  out" 
non-stationary  components.  In  the  present  investigation,  applications  of 
first  or  second  difference  filters  remove  the  non-Btatlonary  components  of 
the  data.  Once  we  have  obtained  a model  for  the  filtered,  stationary  series, 
we  must  employ  the  appropriate  "backward"  filter  to  replace  the  non-stationary 
components.  Tha  result  will  be  a model  that  can  be  used  to  obtain  forecast 
values  of  the  original  non-stationary  time  series. 

In  section  2,  the  autoregressive,  moving  average  and  mixed  processes  ere 
defined  and  a procedure  for  obtaining  tha  "best"  statistical  model  among 
them  is  explained  in  greeter  detail.  In  section  3,  this  procedure  is  applied 
to  a amoothed-data  version  of  the  mosquito  population  density  data  and 
forecasting  models  are  developed.  The  smoothing  procedure  is  that  employed 
by  Hackar  et.  al.  [ 1 ].  In  section  4,  the  procedure  developed  in  section  2 
Is  applied  to  the  original,  non-smoothed  mosquito  population  data  and  fore- 
casting models  are  developed.  Finally,  in  section  5 we  discuss  the  approach 
used  in  this  lnvastlgatlon  os  compared  to  the  approach  used  by  Hacker  et.al. 
and  dsacriba  further  research  balng  contemplated  in  this  area. 

t,  PROCEDURE.  A discrete  m-order  outo/ieg4£44-tve  pAc ceA4  derived  from  a 
purely  random  process  is  given  by 

*t  - >*  - °l<Xt-rw)  + a2<Xt-2  - *>  + •••  + am(Xt-»  - **>  + \ (2.1) 

where  X is  the  autoregressive  aeries;  a.,  a„,  ...»  a ere  parameters  of  the 
process;  end  p is  tha  expected  value  of  the  aeries  X.T  Such  a process  assumes 
that  the  current  value  of  a aeries  can  be  expressed  a a a linear  sum  of  past 
values  plus  an  independent  error  term  Zt,  not  connected  with  the  past. 

A finite  moving  average  pAoceAA  of  order  q la  given  by 

*,  - * ■ zc  - "iVi Vt-,  <2-2> 

where  Xfc  is  the  moving, average  series;  B-,  •••»  Bq  are  parameters  of  the 

process;  and  p is  tha  expected  value  of  Che  series.  This  process  is  Interpreted 
as  a weighted  sum  of  a random  series,  Zfc. 

A mixed  au£otutg/vtAAivt-mo  ving  average  pno ceu  of  order  (m,  q)  is  given  by 

xt-u  - + •••  + a«(xt-«"w)  + zt  - sizt-r  •••“Vt-q (2,3) 

where  the  value  of  q is  independent  of  the  value  of  m and  all  other  symbols 
are  aa  defined  above. 
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The  procedure  used  in  the  present  investigation  to  determine  an 
appropriate  statistical  time  series  model  is  that  procedure  developed  by 
Tsokos  [ 2 ] and  is  suaaarised  below: 

(1)  Tut  the.  origin  at  Aeriu  {ok  Atationarity.  A trend  test  such  as 
Kendall's  t is  used  to  test  for  stationarlty.  If  the  original 
series  fails  this  test,  a first  difference  filter  is  applied  to 
the  original  series  to  create  a new  series.  The  testing  procedure 
is  repeated  and  first  difference  continue  to  be  applied  as  necessary 
until  a time  serlaa  is  obtained  that  passes  the  stationarlty  test. 

A second  order  difference  filter  is  usually  sufficient  to  filter 
out  non-e tat ionary  components. 

(li)  Determent  the  "but"  AtatiAtical  time  Aerie 6 model.  Using  the 

time  series  obtsinsd  in  step  (1)  a computerised  searching  procedure 
is  initiated  to  determine  the  model  and  its  order  from  among  the 
models  discussed  above  that  best  fits  the  data.  The  criterion  for 
selecting  the  beet  model  for  the  filtered  series  is  based  upon 
estimates  of  residual  variances.  One  proceedo  by  eat lasting  the 
parameters  of  the  different  models  for  different  orders.  The 
residual  variance  estimates  are  then  computed  and  recorded  against 
the  orders  of  the  processes.  The  minimum  rasidual  variance  will 
correspond  to  the  order  and  type  of  process  which  best  fits  the 
filtered  aeries. 


(ill)  Apply  an  appropriate  backward  {liter.  If  the  original  time  series 
were  non-stationery,  then  the  model  chosen  under  step  (11)  was 
appropriate  for  the  filtered,  stationary  series.  Hence,  at  this 
step  a backward  filter  is  applied  to  replace  the  non-etatlonary 
components.  For  example,  if  a first  difference  filter,  yt  - xt~xt-l* 

had  been  applied  to  the  original  series  and  the  appropriate  model 
for  the  filtered  series  had  been  of  order  (1,  1)  then  the  model 
has  the  form 


yt  - 


°l(yt-l  - >*>  + Zt  ‘ BlZt-l 


(2.4) 


where  y,  , and  are  estimates  of  the  parameters  based  upon  the 

filtered  series  (see  Tsokos  [2  ]).  Written  in  terms  of  the  X^'s  . 

equation  (2.4)  becomes 


Xt  - (1  - aju  ♦ (1  ♦ - a^.2  ♦ Zt  - (2.5) 

the  process  of  going  from  equation  (2.4)  to  equation  (2.5)  is 
called  "applying  the  appropriate  backward  filter".  It  is 
equation  (2.5)  that  is  then  used  in  step  (iv)  to  forecast  future 
values  of  the  Xfc  process. 
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(iv)  Fofiecait  value*  the.  OJugmal  tim  iejue*  t-day&  ahead.  We 
desire  to  forecast  a value  xt+{,,i  >_  1 when  we  arc  currently  at 

time  t.  For  example,  as  discussed  in  Tsokos,  the  generalised 
mixed  model  under  the  influence  of  a first  difference  filter  has 
minimum  variance  i-day  ahead  forces t given  by 


V‘> 


*0  + ♦iW  * 


+ *®4-lXt+l.-m-l 


(2.6) 


■ B, Z. , | . — . # • * 0 Z._ 

1 t-W-1  q t+£-q 

a 

where  Xt(i)  « Et£Xt+)lJ»  i.e.,  the  expected  value,  at  time  t,  of 

X . The  constanta  ^ are  functions  of  p and  the  a^'s;  the  fi^'s 

are  defined  previously;  and  Zfc  ■ Xfc  - Xt_^(l).  Hue  to  the 

recursive  property  of  the  mixed,  auto regressive -moving  average 
process,  when  we  forecast  with  a lead  &>  2,  our  forecast  is 
dependent  upon  the  previous  forecasted  value (s). 


In  addition  to  the  procedures  discussed  above,  one  could  proceed  to 
compute  confidence  intervals  for  forecasted  values  and  to  employ  updating 
methods  for  use  in  the  model  as  new  time  series  observations  are  obtained. 
These  techniques  are  not  discussed  here  but  are  well  documented  in  the  paper 
by  Tsokos. 


3.  TIME  SERIES  MODELS  FOR  THE  SMOOTHED  DATA.  Because  of  the  (apparent) 
high  noise  level  in  the  raw  light  trap  data,  Hacker  et.al.  [ 1 ] smoothed 
the  data  using  a cubic-spline-integration  method  that  16  described  in  their 
paper.  Figures  1 through  3 below  are  graphs  of  the  original  population  data 
(solid  lines)  and  the  smoothed  data  (dotted  lines)  for  the  year  1969,  1970 
and  1971  respectively,  collected  during  the  months  May  through  October. 

Figures  4 through  6 show  the  smoothed  data  (solid  line)  and  the  4-day 
ahead  forecast  values  (dotted  line)  for  each  of  the  years  1969,  1970,  1971. 

As  can  be  seen  by  inspection,  the  agreement  is  very  good  except  for  the  lag 
between  the  two  curves  which  is  characteristic  for  time  series  work.  All 
three  years  data  required  first  difference  filters  for  stationarity  and 
were  best  fitted  by  order  3 moving  average  processes.  The  models  for 
1969,  1970,  1971  are  presented  as  equations  (3.7),  (3.8)  and  (3.9),  respectively. 

Xt  - .0083  + (.9)Zt  - (.l)Zt-i  - (.5)Zt_2  (3.7) 

Xt  - -.0278  + (.9)Zt  + (.2)Zt_l-  (.3)Zt_2  (3.8) 

Xt  - .0431  + (.9)Zt  + (0.0)2^  - C.3)Zfc_c 
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Fig.  ♦ Observed  as  4-day  ahead  forecast  ssrles  for  1969 
saoothed  data. 
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4.  TIME  SERIES  MODELS  FOR  M'J  PATA.  A date-smoothing  technique  such  aa  that 


applied  by  Hacker  et.al.  la  a reasonable  approach  to  use  if  one  can  demonstrate 
that  the  sample  population  density  data  is  more  erratic  than  that  expected  for 
the  true  density  behavior  and  If  one  can  Identify  the  sources  of  noise  in  the 
data.  However,  to  the  knowledge  of  the  present  authors,  this  has  not  been  done. 
Hence,  the  nagging  possibility  remains  that  smoothing  techniques  may  remove 
basic,  essential  components  of  the  data.  In  this  section  ve  avoid  the  problems 
Inherent  in  the  use  of  smoothed  values  to  predict  smoothed  values  by  formulating 
statistical  time  series  models  for  the  row  data  Itself.  The  following  graphs 
show  the  results  of  applications  of  the  procedure  discussed  in  Section  2 to 
the  three  sets  of  raw  data. 


In  Figure  7,  the  raw  data  collected  during  May  through  October,  1969  is 
shown,  along  with  the  one  day  ahead  forecast  generated  from  the  moving  average 
model  of  order  3: 

*t+l  " “(*08)  " <-83K  ' (*71>xt-l  “ (4 

Again,  the  agreement  is  quite  good  except  for  the  characteristic  time  lag. 

Figure  8 is  a visual  display  of  the  stationarlty  test  of  step  (i).  If  a 
series  is  stationary,  then  its  sample  autocorrelation  function,  r (k),  should 

dampen  out  to  sero  fairly  rapidly  where 


'*x<k>  - ^ 


n-k 


for  k • 0,  1,  . . . , n-1 


£ (x  - x)  (x.  . . - x) 
t-1  C 


A a can  be  seen,  r (k)  does  not  dampen  out  quickly  for  the  original  series 

(solid  line)  but  upon  application  of  a second  difference  filter, 

y ■ x.  - 2x_  . + x_  ,,  r <k)  for  the  filtered  series  doeB  dampen  out  quickly 
t t t-1  t-2  yy 

(dotted  line). 

Figures  9 and  10  show  the  1-day  ahead  forecasts  for  the  raw  data  collected 
during  May  through  October,  1970  and  1971,  respectively.  For  the  1970  data, 
a first  difference  filter  was  required  for  stationarlty  and  the  forecasts 
were  generated  from  the  second  order  moving  average  process. 

- -(.014)  - <.20)zt-1  - (.30)st-2  (4.2 

The  1971  raw  data  required  a second  order  difference  filter  for  stationarlty 
and  the  forecasts  were  generated  from  the  third  order  moving  average  process. 

xfc  - (.056)  - (.99)*^  - <.09)st_2  + (.18)*t-3  (4.3 

The  graphs  for  the  sample  autocorrelation  functions  are  not  included  here 
since  they  are  qualitatively  similar  to  Figure  8. 
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Observed  series  vs.  1-day  ahead  forecast  for  1969 
rav  density  data. 
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5.  CONCLUSION.  It  has  been  shown  in  previous  sections  that  the  techniques 
d i r:  cussed  here  can  provide  adequate  forecasting  models  for  non— stationary 
time  series,  even  if  those  series  have  suspected,  but  unaccounted-for  noise 
components.  'Ilvis  seems  to  be  an  important  improvement  over  the  smoothing 
approach  used  by  Hacker  ct.al.  If  the  source  of  the  noise  is  not  identified, 
then  one  can  not  be  sure  that  smoothing  the  data  will  lead  to  a set  of  data 
that  is  more  representative  of  the  true  state  of  nature.  The  technique 
developed  in  the  present  investigation  avoids  this  problem  by  formulating 
time  series  models  using  the  raw  data  itself.  (It  should  be  pointed  out 
that  the  complete  procedure  discussed  in  the  previous  sections  is  contained 
within  r software  computer  package  developed  by  the  authors.) 

Further  research  along  the  lines  of  the  present  investigation  is  now 
being  contemplated.  Hacker  et.al.  discuss  a method  for  using  their  model 
equations  to  develop  simulation  studies  useful  in  evaluating  various  control 
strategies  for  mosquito  populations.  Their  method  consists  of  adding  the 
(previously  suppressed)  error  term,  et>  assumed  to  be  normally  distributed. 

They  then  can  sample  independent  random  normal  variates  with  the  same  variance 
as  that  estimated  from  the  data,  and  use  these  values  to  drive  the  process. 

It  turns  out,  however,  that  using  a random  walk  of  this  type  occasionally 
yields  pseudo-observations  outside  the  range  of  those  observed  in  the  Malvern 
study.  To  remedy  this,  they  employ  a mathematical  condition  that  reflects 
the  process  away  from  the  boundaries  of  negative  values  and  overly-large 
values.  The  present  authors  are  considering  application  of  this  simulation 
approach  to  the  models  developed  here,  which  we  believe  are  more  representative 
models  of  actual  behavior  of  mosquito  population  densities. 


In  addition  to  the  simulation  studies,  the  present  authors  are  initiating  a 
spectral  analysis  approach  to  the  study  of  this  problem.  Such  an  approach  will 
give  a better  understanding  of  the  intricate  details  and  inter-relationships 
between  the  essential  variables  involved  in  the  study  of  the  behavior  of  mosquito 
populat ions. 
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CURVE  FITTING  OF  DISCRETE  POINTS  BY  LEGENDRE  POLYNOMIALS 


Oskar  M.  Essenwanger 
Physical  Sciences  Directorate 
US  Army  Missile  Command 
Redstone  Arsenal,  Alabama  55809 


ABSTRACT.  It  is  veil  known  that  the  Legendre  polynomials  render 
the  least  square  fit,  while  Tchebycheff  polynomials  provide  the  minimum 
of  the  maximum  deviation  from  the  observed  points.  Therefore,  it  should 
be  assumed  that  the  selection  of  the  desired  type  would  depend  only  on 
the  primary  goal  of  the  analysis. 

Legendre  polynomials  are  in  widespread  use  in  mathematics,  but 
their  application  to  statistical  problems  is  rarely  found.  This  can 
not  always  be  attributed  to  the  differences  in  goals  between  statis- 
tical and  mathematical  analysis.  One  of  the  reasons  may  be  the  difficulty 
of  adjustment  of  Legendre  polynomials  to  discrete  point  curve  fitting  as 
necessary  in  statistical  analysis.  While  the  Tchebycheff  series  is  ortho- 
gonal for  discrete  points,  the  orthogonality  of  the  Legendre  series  is 
based  on  the  continuous  type  and  does  not  hold  up  for  a small  number  of 
discrete  points. 

The  author  has  attempted  to  display  first  the  fitting  of  discrete 
points  by  Legendre  polynomials  and  compare  the  results  with  the  Tchebycheff 
series.  Furthermore,  examples  are  given  for  the  calculation  of  the  co- 
efficients of  the  Legendre  series  from  discrete  pointB,  and  their  relation- 
ship with  the  left  variance.  Finally,  the  most  advantageous  utilization 
Legendre  polynomials  in  statistical  analysis  is  a fitting  to  N > 50 
where  the  Tchebycheff  series  becomes  difficult  to  handle. 

1 » INTRODUCTION . Although  certain  types  of  polynomials  such  as 
the  Legendre  polynomials  are  in  widespread  use  for  curve  fitting  in 
mathematics,  their  application  in  statistical  analysis  can  rarely  be 
found.  It  is  a well  known  fact  that  individual  polynomial  types  serve  a 
special  purpose  and  have  particular  properties . Among  orthogonal  poly- 
nomials the  Legendre  polynomials  render  the  least  square  fit  while  the 
Tchebycheff  polynomials  provide  a solution  where  the  maximum  deviation 
is  a minimum. 
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It  may  be  ipKuUtUa  that  thla  latter  property  la  a preferred 
goal  ia  atatiatleal  analysis,  and  therefore  the  Tchebyehaff  tariaa 
la  mostly  utilised.  Thla  fact  ia  contradicted,  however,  by  various 
art  is lea  whose  authors  have  employed  empirical  polynomials.  Thus 
oea  would  think  leant  square  solutions  should  he  found  among  the  da- 
airad  analysis  goals,  and  ooa  could  discover  Legendre  polynomial 
fits  in  statistical  analysis. 

do  it  will  become  clear  from  the  subsequent  discussion,  ooa 
possible  season  for  the  absence  of  Legendre  polynomial  fits  ney  be 
the  difficulty  of  adjusting  Legendre  polynoadale  to  discrete  point 
fitting.  Many  data  la  atatiatleal  analysis  era  given  or  prepared  in 
the  fom  of  discrete  points  rather  than  the  continuous  type  of  solution 
uhich  are  usually  11 lust rated  in  na theme t leal  tents,  although  a limited 
nuaher  of  discontinuities  in  the  observations  ( step  functions)  have 
boon  accepted  in  nuawrical  analysis. 

The  fitting  of  Logaadra  polynomials  to  disc rata  points  has, 
therefore,  bean  studied  in  dataila  ia  the  subsequent  sections,  da 
wa  shall  laarn  the  major  problem  is  not  tha  preparation  of  Legend ra 
polynomials  for  diaerace  point  fitting.  Tha  difficulty  lias  in  the 
determination  of  the  proper  coefficients  for  tha  series  from  diaerata 
points.  Although  the  legend re  aeries  is  orthogonal  in  a continuum, 
tha  aariss  loaaa  its  orthogonality  for  a small  lumber  of  discrete 
points. 

Aa  will  be  outlined  coefficients  from  integrals  can  be  calcu- 
lated by  numerical  Methods , but  disadvantages  still  remain  with  respect 
to  the  left  variance.  The  Tchebyehaff  end  the  Legendre  aerlea  are 
fitted  to  wind  profile  data  and  the  results  are  comparable.  It  will 
be  learned,  however,  that  the  Legendre  series  would  be  most  advanta- 
geously used  for  the  number  of  points  greeter  then  JO,  even  better  for 
more  than  JO  points  where  no  table  values  for  the  Tchebyehaff  series 
are  readily  available,  end  the  orthogonality  of  the  Legendre  series  is 
restored.  It  should  be  added  that  orthogonalised  seta  of  dlacreta 
Legendre  polynomials  for  faw  points  assuot  ths  asms  numerical  value* 
as  found  for  tha  Tchabycheff  aariaa. 


2.  HUt  LEGENDRE  SERIES.  As  can  readily  be  found  in  various 
texts  (e.g.  Boas,  1 9&>;  Abramowitz  and  Stegun,  196k;  Essenwanger,  19T5a, 
etc.)  the  Legendre  polynomials  comprise  an  orthogonal  system  over  the 
interval  -1  ^ x < 1.  For  details  of  their  analytical  expressions  the 
reader  is  referred  to  the  literature.  Let  us  denote  here  the  Legendre 
polynomial  by  PQ( x) , where  n represents  the  order. 


The  Legendre  polynomials  are  orthogonal,  i.e. 


♦1  ( 
/ w - 

| 2/(2n+l)  for  h * k ■ n 

} 

(1) 

-1  1 

10  for  h * k. 

Any  function  Y (z)  would  be  represented  by  Legendre  polynomials  with 
the  transformation  y(x)  = Y(z).  Then 

y(*)  1 

■ 2 a_P  (x) . 
w*o  n n 

(2) 

The  coefficients  must  be  determined  from 

.*n  * 

[(2nr*l)/2]  f y<x)Pn<x)dx, 

(3) 

and  here  begins  the  difficulty  in  practical  work  with  discrete  points. 

If  Y(z)  is  a function  which  can  be  expressed  in  analytical  terms,  and 
the  integral  can  be  solved  explicitly,  the  representation  of  any  function 
by  Legendre  polynomials  is  trivial.  Such  examples  can  be  found  in  almost 
any  text  on  mathematics  or  numerical  analysis  where  polynomials  are 
covered.  In  the  atmospheric  sciences  or  other  branches  with  statistical 
analysis  we  are  mostly  interested,  however,  in  expressing  a discrete 
function  Y(  z)  by  polynomials.  While  the  coefficients  for  the 
Tchebycheff  series  are  simple  to  calculate  even  in  this  case,  the  usual 
procedure  of  replacing  the  integral  by  the  summation  sign  is  insufficient 
for  a small  number  of  points,  i.e.  we  cannot  merely  state 


a 


n 


y(x)Pn(x)dx. 


(4) 
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Thi«  replacement  would  be  a permissible^  approximation  for  a large 
number  of  points , say  probably  for  about  30  or  more  and  the  number 
of  terms  n « 30.  For  a small  number  of  points,  i.e.  seven,  this  formula 
generally  does  not  provide  the  coefficients  aR  accurate  enough 
to  be  of  value. 


We  may  evaluate  the  success  of  engaging  eqn.  ( 1)  by  calculating 
two  polynomial  characteristics,  the  variance  Varp  and  an  integral, 

which  we  may  call  . The  two  parameters  have  analytical  solutions 

depending  on  n and  are  defined  by 


and 


Var, 


+1 

J P^(x)dx  * 2/(2n+l), 

-1 

? p <*)d*  = n22(-Dv 

v=o  2vv ! ( n - 2v) ! 


(5) 


( n-2v+l) 


■ 0 for  even  n 4 0. 


(6) 


Against  these  expected  values  the  empirical  counterparts  can  be 
obtained . 


Varl  = X P*(x  )£x  -♦ 
n 1=1  n 


2/(2n  + 1) . 


(7) 


The  summation 


si 


Xj»l 

2 P_(x.)Ax 

xj=o  n *■ 


(8) 


is  somewhat  more  difficult  to  calculate  due  to  considerations  in  the 
marginal  class  intervals . If  the  two  border  points  = 0 and  x,  = 1 
are  utilized,  the  Pq(x^=0)  and  P^x^l)  must  be . multiplied  by 

Xi=l 

Ax/2.  Otherwise,  2 Ax  = 1 is  not  fulfilled. 
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Transformation  from  tha  z to  the  x system  is  based  on  the 
equalization  of  the  ranges  and  references,  i.e.  x » z . Conse- 
quently 

x/*r  - (z-zo)/zr  (9) 

( the  reference  equivalent  to  z is  x0  = 0) . 

Since  most  of  the  observed  discrete  variables  can  be  arranged  in 

steps  of  class  Intervals,  two  versions  of  the  transformation  must  be 

accommodated.  Let  vis  assume  that  7 points  Y(  z .)  are  given.  We 

nuni>er  the  variate  z from  z^  = 1 through  Zy  = 7 (with  unity  steps)  . 

If  other  scales  are  given,  they  can  be  reduced  to  this  basic  form  (see 

later  Table  1).  The  transformation  in  this  case  can  be  written  (with 

x » 2 and  z » z - z,  ■ 6) 
r r r 1 


(10a) 


or 


3x  + 4 **  z. 


(10b) 


We  shall  call  this  version  one. 

If  we  consider  z^  = 1 with  a lower  class  boundary  of  ^ = 0.5 
and  the  upper  boundary  of  z as  z «*  7.5,  the  z **  z . - z,  « 7, 

-*  f-  $(►*>  « r 7»  1 (1U) 

or  5.5*  + 4 - z,  (Hb) 

This  may  be  called  version  two.  The  resulting  Legendre  polynomials 
for  these  two  Interpretations  are  given  in  Table  1.  The  respective 
Var'  and  S'  parameters  are  listed  in  Table  2 for  four  different 
number  of  points. 


It  is  self-evident  that  the  expected  Varp  and  S..  are  best 
approximated  for  the  largest  subdivision,  namely  51  points.  The  second 
version  renders  a slightly  better  approximation  than  the  first  version. 
The  deviation  Increases  with  ascending  polynomial  order.  In  other 
words,  at  least  about  30  points  are  needed  to  calculate  the  coefficents 
accurately  enough  by  mere  summation. 

It  will  be  further  seen  that  the  discrete  Legendre  polynomials 
for  a small  number  of  points  are  not  fully  orthogonal  ( see  Table  3) • 


i 


3.  MTtHWMATIOM  Of  THE  COH8TAMTS.  As  can  b«  readily  seen  fro* 
Tsbls  3.  the  discrete  series  of  Legendre  polynomials  for  a snail  nusber 
of  points  is  not  fully  orthogonal.  In  an  orthogonal  ays  tarn  only  the 
diagonal  of  the  matrix  would  remain  non* aero.  Thus  the  calculation  of 
coefficients  Is  problematic  by  replacing  an  orthogonal  system  end  the 
integral  by  sisaaation.  The  coefficients  of  a non -orthogonal  system  can 
be  properly  calculated  as  outlined  for  linear  systems  (see  Xsseowanger, 
1973a) • This  is  equivalent  of  converting  the  "covariance  matrix"  (left) 
into  the  "coefficient  matrix"  (right): 


fiff*  tp  p ...»  p , tp  yl 

1 «0  0 0 e e e 0 f A 

o * o a o n o*  oJ 

o 

IViaf  *,*1 

- , 

o ieO  o • 0 • o $ ^ 

e e a e 

e 

UP  P EP.  P EP  P ...JT*  , £P  y 

e e e e 

0 0 h i.O.  A 

o n 1 n an  n * n . 

L 

This  conversion  has  baen  treated  in  many  texts  or  by  Che  author  ( 1973b, 
eactlon  3)  and  is  equivalent  with  the  dtagonalisation  of  a matrix. 


This  tachniqua  does  not  provide  "Legendre  coefficients"  unless  the 
matrix  contains  a sufficient  number  of  terms  (l.e.  orders  of  P.) . K.g. 
the  following  coefficients  are  obtained  for  an  approxiamtlon  of1  y(x) 
being  a third  plus  fourth  order  Tchebycheff  polynosdal  of  ^ points  (eae 
Table  8) . The  last  row  in  each  version  of  Table  4 is  identical  with 
the  Legendre  coefficients. 

Table  3.  Covariance  matrix  for  7-polnt  discrete  Legendre 
polynomials . 


Version  1 


9 

o 

P 

. 2. 

p 

..  3 - 

P 

4 

Pg 

7.0 

0 

1.17 

0 

1.55 

0 

0 

3.11 

0 

1.38 

0 

1.01 

0.17 

0 

2.53 

0 

1.66 

0 

0 

1.30 

0 

2.47 

0 

1.09 

1-53 

0 

1.66 

0 

2.51 

0 

0 

1.81 

0 

4.89 

0 

2.41 

Table  4 . Coefficients  for  3,  4 
Legendre  polynomials. 


Vers icy  id 


p 

0 

>1 

P 

a 

p 

a 

7.0 

0 

-0.07 

0 

0 

2.29 

0 

-0.16 

-0.0? 

0 

1.26 

0 

0 

0.16 

0 

0.75 

-0  '15 

0 

-0.20 

0 

0 

0.34 

0 

-0.34 

and  3 term  of  the  discrete 


P 

*_ 


-0.23 

0 

-o.ae 

o 

0.43 

o 


o 

-o 

o 

-o 

o 

o 


Version  1 


Version  2 


*•  fro*  this  table  chat  tho  "UgiodN  coefficients"  (last 

row)  art  net  cha  nest  advantageous  coefficients  for  an  incomplete  ays tea, 
but  the  aolutlena  converge  with  the  lneluaion  of  a sufficient  nuafcer  of 
(er  all  possible)  esnos.  Soot  readers  nay  prefer  this  Method  of  calcu- 
'•tin*  coefficients  since  it  ia  Mathematically  exact  and  it  certainly 
proves  advantageous  ones  the  nuaber  of  terns  in  the  series  has  been 
decided  upon.  As  in  any  non-orthogoaal  systaa,  the  addition  of  tarns 
requires  a recalculation  of  coefficients,  however. 

It  is  also  possible  to  utilise  nunerical  solution  for  calculation 
of  integrals,  such  as  Gregory's  or  finpaon's  rule  (see  Kaaenaanger, 

197%.  or  Abraaovits  and  Stagun,  1964) . The  author  ( 1977a)  has  developed 
an  iterative  process,  which  in  combination  with  Gregory's  or  Siapeon's 
technique,  works  raaaonably  well.  This  combination  ia  necessary  since 
Gregory's  or  Siapaon'a  approximation  becomes  less  efficient  for  the 
higher  order  terns  idiile  given  correct  entries  for  the  lower  order  terns 
the  iterative  steps  lead  to  good  approximations  ( see  Tables  7 and  6) . 

As  evident  from  Table  4,  the  coefficients  fron  the  covariance 
Matrix  are  not  identical  "Legendre  coefficients"  as  they  would  be 
obtained  fron  analytical  solutions  of  the  coefficient  integrals  (eqn.  3). 
The  two  sets  converge  only  aftar  a sufficient  nuaber  of  tana  is  carried. 

Ia  the  utilisation  of  numerical  methods  for  calculation  of 
integrals,  however,  the  "Legendre  coefficients"  are  obtained  directly, 
it  possible,  without  further  Modification  ( see  Table  3) . The  reader 
nay  ask  whether  it  would  be  desirable  to  calculate  Legendre  polynomial 
coefficlenta  under  these  circumstances  because  they  do  not  provide  the 
beat  fit  for  an  insufficient  nunber  of  tens. 

It  may  be  replied  that  generally  curve  fitting  is  of  little  value 
«Iaaa  at  least  00  to  9 of  tha  ‘variance  has  been  explained.  In  these 
cases  tha  coefficients  fron  the  solution  via  tha  covariance  saatrlx  and 
numerical  methods  fron  Integrals  aerge  (see  also  lstcr  the  example, 
section  6)  . The  question  should  bo  rephrased:  Do  the  Legendre  poly- 

denials  fulfill  any  need  since  the  orthogonal  eye tern  of  Tchebycheff 
polynomials  is  available?  The  answer  will  be  given  after  soma  further 
discussion. 
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non  or  Discun  lkehobk  r i 


nadtr  my  uk  whether  the  discrete  Legendre  polyiwal«li  eould  be 
orthogonal laad.  Without  doubt,  orthogonallsatloa  la  technically 
feasible,  and  tha  author  haa  producod  an  orthogoaallaad  aat  of 
polynomials  for  tha  7'P^tnt  Lagandra  polynomials  which  vara  givan 
la  Tabla  1.  Thla  orthogonal laad  aat  la  axhlbltad  la  Tabla  7.  It 
auat  ba  raportad  flrat  thqt  version  1 aad  varalon  2 mrgad  te  only 
ana  aat  aftar  thla  orchogoaallaatloa  proeadura. 

A eloaar  paruaal  of  tha  orthogoaallaad  aat  ravaals  that  tha 
eoluaaM  of  Tabla  7 ara  non  ldantlcal  with  tha  Tchabychaff  7* point 
polynomials  axcapt  for  rounding  and  a multiplication  factor.  Thla  1 
boon  found  for  othar  number  of  points,  too.  Xdantity  with  tha 
Tchabychaff  ay atom  implies,  howavar.  chat  thla  orthogonal laad  aat 
haa  alao  aaaumd  tha  propartlaa  of  tha  Tchabyehaff  polynomials. 
Conaaquantly  thara  would  b#  no  raaaon  why  tha  Tchabyehaff  poly- 
nomials could  not  bo  aaployad  a * priori,  a Inca  tha  original  purpoaa 
of  utlllaing  tha  Lagandra  aarlaa  la  dafaatad  with  tha  changa  of 
propartlaa.  Conaaquantly  for  a small  number  of  polnta  M tha 
discrete  Legend ra  aarlaa  would  not  ba  very  advantageoua  while  Ita 
application  for  a larger  I (a.g.  B > 30)  should  prove  useful. 


Tabla  7.  Orthogonallaad  aat  of  dlacrata  Lagandra  polyaoalala 
of  Tabla. 


0.5455 

•0. 

0 

0. 

■0.3273 

0. 

0.4365 

0 

0.3273 

-0. 

0 

0.3455 

-0. 

0. 

This  atataaant  Is  even  more  valid  for  M > 50  because  mat 
tabla  values  of  Tchabychaff  polynomials  discontinue  after  If  - JO. 
Since  for  larger  N the  Integral  in  eqn.  ( 3)  can  ba  replaced  by 
the  summation  sign  with  sufficlant  accuracy,  and  tha  Lagandra 
system  becomes  orthogonal  again,  tha  difficultlea  encountered  for 
few  pointe  disappear. 


|nvia«ny  pointed  out  that  elm  goal 
alaactIM  aa  an  attnpt  to  teaaiiha 
hj  • aathonatlcal  atgnaalfta,  tf  ab 
ay  of  tha  |1mo  data  and  ^ of  tl 

Identical.  Vo  c«,  therefore?  aatbai 
of  cho  ftoooo—  la  eum  fitting  by  it 


It  M 

1 la  eum  fitting  eaa  also  bo 
a the  variance  of  tbo  function  y 
bo  natch  la  perfoat,  tha  varlaaeoa 
tha  oaalytleal  eoaataaprt,  ora 


iwatleally  fomulaoa  a erltorloo 
defining  a loft  variance 


«£  • *tyt  - *,*)*/*• 


Tbo  captained  vorlooco  la 


^ - *x  • 


*5  * *5'<$  ■ 1 • *!/«$ 

eaa  bo  eallod  reduction,  and 
*»  ■ *5  • MC* 

la  than  the  percentage  reduction. 


Aa  Uluatrated  In  detail  ( 
can  be  written  aa 


Kaaeamafar,  1979a)  the  left  variance 


"t  ♦ (•„  - a*  + *„ 
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shore  9*  denotes  the  mine*  of  y,  and  y la  tb*  neaa,  aQ  tha 
^lfocaUl  coefficient  of  order  aero.  The  M denotes  the  euosntion 
of  the  elaneeta  In 


He  define 

-«af  ..{ej.  (ui 

o 

Thee  la  ea  orthogonal  ays ten  - -M#  end  ■ «J  ( see  Issenosnger, 

o 

1979s).  Fur  the  more,  aQ  ■ y end 

(1J*> 

In  a Don-orthogonal  systan  sqn. ( 15)  cannot  be  reduced  to  single  terns. 
Because  a ts  not  necessarily  y In  a non-orthogonal  systan,  we 
could  define 


^-oJ  + - y)* 


*»  - 1 * 


(17) 

(14c) 
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Ut  ui  iiium  chat  the  given  data  ara  tha  two  tai 


for  tha  Tchebycheff  polynomials  for  aavan  points . Thla  axaapla  has 
bean  aa lac tad  bacauaa  tha  coefficients  of  tha  Lagaadra  polynomials 
aarlaa  can  ha  calculated  by  Integration.  The  following  table  reaulta. 


Table  8.  fllvan  y and  eoafflclanta. 


i 


+ iF ,+iP  + a P 

i * a a a : 


i P + a P , 
3 3 4 4 


la  identical  with  tha  data  yt  . 1 

0 and  al  - 160/7  - 22.857  where 


with  tha  coefflcienta  of  Table  8, 
Thu a va  have  a perfect  match. 

1 „a  - 6/7  and  - 22.0. 


7 0 0 
‘j,  3 * 

5.8%  and  a*  ■ 96.251  for  tha  Tchebycheff  aarlaa. 


i 

1 


The  matrix  (veraion  1)  becomes  for  the  clx  coefficients 

a through  a_ : 
o * 


e*  = 22.96  + 1.69  - 24.55  - 0 


is  confirmed. 

The  left  variance  of  the  Individual  term  sequence  is 

- E(y-ao)2/N  - 24.55  * 22.86  + 1.69 

- Hy-a^a^)2/!*  = 24.83  • 24.55  + 0.28 

- S(y-a-a1Pl-a2P/  = 24.83  + 18.08  = 42.91,  etc. 

= 41.76 
* 0 • 

The  corresponding  numbers  for  version  two  are  23.36,  23.37> 
26.83,  25.96,  0.  It  leads  to  the  percentage  reduction  as  displayed 
In  Table  9. 


i 

1 , 

jj  Version  j 

fi  . i 

with 

> 1 1 

with  C2 

a 

1 y 

Version  with  of 

y 

with  8* 

y 


-2.2  j -0.1 


o.o  -o.i 


-79.1 

-73.6 


-151 


102.7 

170.1 


115.6 

111.1 


The  positive  reduction  begins  in  both  versions  with  the  third  order 
tern.  Although  the  actusl  percentage  contributions  of  the  third  and 
fourth  order  terns  are  not  conpletely  identical  with  the  numbers  from 
the  Tchebycheff  system,  the  important  feature:  run  parallel;  namely  a 
ana'll  ernt;  ibution  from  a third  order  tern  and  a considerable  dominance 
of  the  fourth  order  tern.  It  may  be  further  conclud*d  that  a representa- 
tion including  only  the  three  coefficients  a through  a is  inadequate 
In  fact,  the  assumption  of  zero  for  these  three  coefficients  above  would 
leave  a smaller  left  variance  than  the  actual  value  (see  matrix  co- 
efficients Table  4) . For  more  details  see  Essenwanger  ( 1975s) • 


6.  AH  EXAMPLE  FOR  WIND  PROFILE  REPRESENTATION.  Two  wind  pro- 
files at  2 km  altitude  level  intervals  were  arbitrarily  selected, 
January  1,  1957  and  1958  at  Montgomery.  The  following  Table  10 
exhibits  the  empirical  data  and  the  approximation  by  polynomials 

up  to  the  fifth  order.  Since  the  correct  coefficients  for  Legendre 
polynomials  cannot  be  determined  a'  priori,  the  effect  of  the 
approximation  cannot  be  directly  shown.  It  may  be  inferred,  however, 
that  the  reconstructed  curve  from  the  Legendre  polynomials  should 
have  a smaller  sum  of  the  squared  deviations  from  the  analytical 
data  than  for  the  Tchebycheff  approximation.  As  can  be  readily- 
checked,  however,  both  sums  are  about  the  same.  This  may  be  seen 
as  a confirmation  of  an  earlier  conclusion  that  for  less  than 
about  twenty  points  the  advantage  of  the  Legendre  series  over  the 
Tchebycheff  series  may  not  show  up  in  practical  work. 

As  an  added  feature,  the  percentage  reduction  is  displayed. 

Ho  problems  are  apparent  for  the  Tchebycheff  series,  whereas  the 
third  order  term  in  version  1 is  negative  which  demonstrates  a 
slight  increase  of  the  left  variance  on  both  dates.  The  percentage 
reductions  for  the  individual  terms  have  been  calculated  by  eqn.  lLa. 
While  S2  is  the  basis  for  the  reduction  in  version  1,  tne  a = y 

y o' 

version  2 and  S2  = CT2. 

y y 

Although  differences  in  the  percentage  reduction  between  the 
three  systems  exist,  the  numbers  are  equivalent  and  imply  the  same 
Integrated  effect.  The  second  order  term  dominates  considerably. 
Besides  this  second  order  term,  a fourth  order  term  contributes  to 
the  1957  date  and  a fifth  order  term  for  1 January  1958.  The  other 
components  may  be  considered  to  have  minor  influence. 

7.  CONCLUSION.  As  has  been  pointed  out  in  the  beginning,  the 
Legendre  series  is  different  from  the  Tchebycheff  series  in  its 
theoretical  approach  to  curve  fitting.  Some  difficulty  arises  when 
cl.e  Legendre  series  is  applied  to  a discrete  function  y{  x)  . For  a 
small  number  of  points  (e.g.  N < 50)  the  discrete  Legendre  poly- 
nomials are  not  fully  orthogonal  and  the  coefficients  cannot  readily 
be  calculated  from  the  regular  coefficient  formula  in  replacing  the 
Integral  by  a summation  sign.  Some  outlines  for  an  approximation 
are  given,  and  more  details  can  be  found  by  Essenwanger  ( 1975a) • 

It  was  pointed  out  that  in  the  sequence  of  this  non-orthogonal 
system  for  a small  number  of  points  the  coefficients  are  not  inde- 
pendent, and  the  contribution  to  the  left  variance  by  the  individual 
term  may  become  negative.  Thus  the  contribution  by  the  individual 
order  cannot  be  readily  judged  by  customary  methods  for  an  ortho- 
gonal system. 


Coeparlaon  of  lepreaentlng  a Ulad  Profile 
bf  Xchebycheff  tad  Legendre  Poljaaeluli. 


«.  leceeyutad  Wind  Profile  ( Hoetgoxery) . 
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As  the  Legendre  polynomials  for  discrete  points  become  simpler 
tc  handle  with  increasing  number  of  points,  they  should  prove  to  be 
a useful  replacement  for  the  Tchebycheff  polynomials  in  solutions 
of  problems  where  the  practical  application  of  the  Tchebycheff 
polynomial  method  apparently  shows  a weakness  such  as  for  R > JO. 

It  is  difficult  to  evaluate  a*  priori,  whether  the  calculation 
of  coefficients  via  the  covariance  matrix  it  more  coat  effective 
than  the  approximations  of  tha  "true"  Legendre  coef r lcients . It 
la  self-evident  thet  Che  calculation  of  the  covariance  matrix  adds 
to  tha  computer  costs  in  the  matrix  solution  while  the  major  part 
of  the  costs  for  tha  "Legendre  coefficient"  remains  with  the 
approximation  and  iteration. 
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1.  INTRODUCTION 

Fire  Control  Sensitivity  Analysis  (the  etfect  of  a change  lit  condi- 
tions on  the  ala  of  a weapon  system)  is  normally  accomplished  on  high- 
speed computer*  because  of  the  extensive  calculation  required.  These 
computers  are,  however,  expensive  to  program  and  operate  when  only  a 
short  or  1-tlae  program  la  needed.  Since  the  purpose  of  the  analysis 
herein  was  for  pre-teat  Information  there  was  no  inherert  need  for  fast 
computations.  Since  a programmable  calculator  was  available  the  analy- 
sis was  programed  on  it. 

It  was  soon  apparent  that  the  physical  limitations  of  the  memory 
and  computation  spend  demanded  special  techniques.  The  first  of  these 
techniques  wss  to  minimize  the  number  of  calculations  which  naeded  full 
praclalon.  The  second  technique  was  to  put  the  answers  after  each 
run  onto  a caaaette  tape  to  free  memory  for  the  next  run.  The  third 
was  to  code  the  output  answers,  which  has  a large  dynamic  range,  into 
Integers,  thereby  reducing  the  amount  of  cassette  tape  required.  The 
fourth  technique  was  to  write  an  iterative  routine  which  automatically 
varied  the  routlnea  and  controlled  the  output  onto  tape.  This  techni- 
que allowed  the  unattended  operation  of  the  calculator. 

The  unattended  operation  of  the  calculator  is  the  most  significant 
feature  of  the  program.  It  allowed  in  this  case  150  hours  of  operation 
time  during  nights  and  weekends.  The  sole  operator  requirement  was  to 
load  and  unload  cassette  tape  at  the  end  and  beginning  of  the  work-day. 
The  detail  of  the  specific  program  shown  herein  is  to  illustrate  the 
very  extensive  calculations  which  can  be  performed  even  by  small  and 
slow  calculators.  It  must  be  noted  that  even  more  extensive  programs 
can  ba  and  have  been  implemented  by  daisy-chaining  the  output  of  one 
progran  into  the  Input  of  another  program. 
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2.  CALCULATOR  PROGRAM  DESCRIPTION 

Equipment:  Hmwlatt-Packard  9830  with  3808  word  memory 

Prograa:  Non-crltical  variable*  written  in  single  (6-digit)  or  integer 

(+215)  precision  notation.  Memory  capacity  available  only  for  a 
single  run.  Output  data  scaled  as  Integers  and  stored  along  with 
the  scaling  factor  on  cassette  tape.  A typical  program  Is  included 
as  Section  10. 

Operating  Conditions:  Computation  time  of  single  derivative  was  30 

seconds.  Number  of  derivatives  calculated  was  11  Instead  of  21  be- 
cause the  rate  of  change  of  derivatives  did  not  warrant  aore.  Con-  ] 

putatlon  was  therefore  approxiaately  5 minutes  per  run.  (900  valid 
runs  were  calculated  for  the  anelyels.)  The  program  was  typically 
act  up  with  indexing  for  162  runs  which  required  approxiaately  15 
hours  of  coaputatlon.  The  program  and  data  tape  was  normally 
loaded  at  the  end  of  th»  day  and  retrieved  in  the  morning.  The 
only  printed  output  generated  was  a single  line  listing  r**-  critical 
Information  contained  in  each  of  the  162  files. 

Output  Processing:  Output  data  froa  the  prograa  was  stored  on  six 

cassette  tepee.  A plotting  routine  read  nine  files  and  decoded  them 
into  memory.  The  platter  was  than  used  to  generate  plots  as  shown 
in  Section  9. 

Comments: 


1.  The  prograa  generates  LOS  angles  which  are  Independent  of  bal- 
listics and  ar*  better  outputed  aeparately. 

2.  The  operating  and  storage  requirements  are  baaed  on  21-polnt 

runs  Instead  of  the  11  actually  used.  ] 

3.  Changing  the  prograa  to  accomodate  Comments  1 and  2 would 
allow  aore  runs  to  be  in  memory.  This  would  reduce  output  require- 
ments considerably,  tape  requirements  by  3 and  files  by  9. 

4.  Comment  3 was  not  implemented  because  the  savings  did  not 
warrant  programming  multiple  plotting  routines  for  different  output 
formats. 
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3.  FIRE  CONTROL  SENSITIVITY  ANALYdS 


1.  Weapon  ballistic  data  in  generally  given  with  respect  to  the 
weapon  line  of  sight  (LOS)  coordinate  system*  Aircraft  position  is 
given  In  terms  of  pitch,  roll  and  heading. 

2.  From  the  aircraft  viewpoint,  the  conversion  into  the  LOS  coordi- 
nates is  straight-forward  because  the  LOS  angles  relative  to  the  air- 
frame are  known.  From  the  sensitivity  analysis  viewpoint,  however,  the 
LOS  angle  arc  unknown  and  must  be  calculated. 

3.  A conversion  routine  from  the  aircraft  data  into  the  LOS  system 
was  developed  for  this  analysis.  The  conversion  matrix  and  its  develop- 
ment is  shown  in  Section  5 of  this  paper. 

4.  The  second  major  part  of  the  program  is  the  Iterative  solution  of 
the  fire  control  equation  in  the  LOS  system.  The  fire  control  equations 
and  the  iteration  technique  is  shown  in  Section  4. 

5.  The  sensitivity  (partial  derivative)  of  the  fire  control  solution 
was  then  determined  by  chanr 'ng  a single  parameter  and  determining  the 
change  in  the  solution.  It  should  be  noted  that  if  all  12  parameters 
were  used  in  this  analysis  technique,  the  output  would  approach  infinity 
(12  sets  of  3 values  each  equal  531,441). 

6.  Because  of  this,  only  pitch,  roll  and  heading  are  changed  for  the 
analysis  of  each  value.  The  other  parameters  were  chosen  either  a maxi- 
mum (l.e.  r-inge)  or  were  determined  to  have  small  linearized  responses 
(i.e.  effects  would  be  additive). 

The  parameters  used  were  heading  (0,45,135*),  pitch  (-20,0,10), 
roll  (0,30,60),  range  40mm  (1500M),  30mm  (3000M),  7.62mm  (2000M),  air- 
speed (100MPS),  wind  speed  cross-  and  head-  (0)  Yaw  angle**  (0),  alti- 
tude (0)  rate  of  climb  (0),  air  density  (1.0),  muzzle  velocity  standard. 

*75  for  40mm  (weapon  could  not  reach  the  target  at  right  angle). 

**Yaw  angle  is  the  angle  between  the  aircraft  heading  and  airspeed. 


4.  BALLISTIC  PIRF,  CONTROL  EQUATIONS 

®l  “ -[<%  + Gl0?  + a5  UB  1 + ^ ( T/Xs)  WL 

+ A8  (Bs  (U^  -W„)  - b„  (U^  -WS)]/V 

BM  ’ -K%  + UB  > + A7.  < T^S)/Wm 

+ ab  (BL  <uAs  - WB)  - bS(Val  -wl)]/v 

Tor  vac  - xs/  (uBs  + Uas  + <*5  csXs/uBs+  U^)) 

AT  - TOF  vac  <f>/ps)  V (AX  Xg  ♦ A2  Xs2) 

TOF  - TOF  yac  +iT 

These  equations  cannot  be  explicitly  solved  because  the  right-hand 
expression  contains  tenss  dependent  upon  the  value  calculated.  This 
stakes  the  solution  iterative,  with  the  previously  calculated  values 
being  used  in  the  equation.  Because  o£  the  nature  of  the  equation,  the 
convergence  to  a specific  value  is  extremely  rapid.  It  can  be  shown 
that  the  error  due  to  truncating  the  iteration  is  less  thin  the  change 
due  to  the  last  iteration. 

Since  there  are  two  equations  to  be  solved,  the  updating  of  values 
was  also  done  between  the  solution  of  the  two  equations. 

Derivatives  of  the  Sensitivity  Program  were  calculated  by  changing 
one  of  the  parameters  and  calculating  the  change  in  the  ballistic  equa- 
tions. The  convergence  check  in  the  program  was  set  to  allow  errors  of 
less  than  0.01  milliradia.n 


Definitions 


B 

G 

L 


M 

S 

S , M , L 
TOF 

TOF  vac 

V_K 

UB 

V 

w 

X 

p/ps 


Unit  vector  along  barrel  (or  launcher) 

Gravity  vector 

Unit  vector  in  the  direction  of  axis  about  which  elevation 
of  £ is  measured 

Unit  vector  LIS  (up  is  positive) 

Unit  vector  in  direction  of  launcher  to  target 

Line  of  sight  coordinate  system 

Time  of  flight 

Time  of  flight  in  vaccum 

Aircraft  velocity 

Projectile  velocity  relative  to  barrel 
Projectile  velocity  relative  to  air 
Windspeed 

Position  vector  of  target 
Acthal/standard  air  density 
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t 


Ht  ■ K Tan  R Cos  P ■ Zp 
N « K Tan  R Sin  P 

AY  - N Cos  4 

AX  - N Sin  * 

Because  Zr  la  In  plana  we  further  obtain: 

Y - K Sln*-6Y  or  K Sin*-  K Tan  R Sin  P Cos* 

- X - K Coa*+AX  or  K Cob**  K Tan  R Sin  P Sin* 

K - Cos  * Cos  P 

Sin*-  Tan  R Siu  P Cos  * 

1/K  - Tan*-  Tan  R Sin  P _ 

^Circled  items  are  computer  program  names 
L - Sin*Cos  P - K (Cos*+  Tan  R Sin  P Sin*;  (1/K)(L)  - H3*  Sin*Cos  P + 
(Cosi|H-  Tan  R Sin  P Sin*)“  © 

3.  The  Aircraft  Unit  Vector  and  points  Zg  and  Zp  describe  a plane 
which  can  be  described  by  the  angles  A and  B as  shown. 


We  have  by  Inspection: 

Tan  A » (Zp  - Zr)/L 
Tan  B - (Zp  - Tan  Al)/y 

- (Sin  P - Tan  A Sin  * Cos  P)  / (Cos  P Cos*  ) 
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■t 

i 

? 

i 

Dividing  top  and  bottom  of  Tan  A by  K w*  have: 

A - AIM!  '(H3  Sin  ? - Tan  R Cos  P)/H4  - ® 

B - <©  - A Tan  '(Sin  P - Tan  H5  Sin  * Cob  P)/(Cos  PC  osi*  ) 

4,  Conversion  from  the  vertical  pane  angles  to  the  Line  of  Sight.  Q5 

angle  from  horizontal  of  the  LOS  to  target.  • 

i 

.1 


T~ 

K Up  H5 

_L 

t 

Jl 


T 

viW  QJ 
1 


1 


H5  ■ Sin  Q5  - Tan  H6  Cos  Q5 

Tan  Cos_q5  jTan^HSjJC 

Tan  T 1 - Sin  2q5  + Sin  H6  Cos  Q5  Sin  Q5 


(Tan  H5) 

Tan  H6  Sin  l}5 


Apparent  angles  M » A tan 


Tan  H5 

Cos  Q5  + Tan  H6  Sin  Q5 


Rotating  by  angle  M around  LOS  1 the  positions  of  the  unit  vector  1,  2 
and  3 become: 
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1, 


. X ■ Cos  Q5,  y - 0,  Z ■ Sin  Q5 

2.  X - Sin  M Sin  Q5,  y - Cos  M,  Z - -Sin  M Cos  Q5 

3.  X - -Sin  Q5  Cos  M,  y - Sin  M,  Z - Cos  M Cos  Q5 
Which  expressed  in  MATRIX  70RM  is: 

Cos  Q5  0 Sin  Q5 

Sin  M Sin  Q5  Cos  M -Sin  M Cos  QS 

-Sin  Q5  Cos  M Sin  M Cos  M Cos  QS 
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APPLICATIONS  OF  SEQUENTIAL  SENSITIVITY  TEST  STRATEGIES 
AND  ESTIMATION  USING  A WE I BULL  RESPONSE  FUNCTION 
FOR  EXTREME  PROBABILITIES  AND  PERCENTAGE  POINTS 


Gertrude  ’’eintraub 
Concepts  and  Effectiveness  Division 
Aamunitlon  Development  and  Engineering  Directorate 
Pleat  imy  Arsenal 
Dever,  New  Jersey 


1.  Abstract 

Two  applications  of  Dr.  Slnbinder's  sequential  sensitivity  test 
strategy  and  estimation  methodology  for  reliability  assessments  are 
discussed.  Principal  interest  is  in  the  determination  of  reliability 
at  extrema  low  or  hl£h  probability  of  response  regions  with  a minimum 
number  of  testa,  anpirlcal  test  data  together  with  analysis  and  inter- 
pretation of  the  results  of  analysis  are  presented  and  conclusions 
drawn. 


2.  Introduction 

Dr.  S.  Einbinder  of  Picatlmy  Arsenal  devised  a sequential  sensitivity 
test  strategy  and  estimation  methodology.  This  procedure  appears  to  be 
more  efficient  than  other  sensitivity  methods  in  determining  extreme 
percentage  points  of  a response  function. 

Application  of  a One-Shot  Transformed  Response  strategy  and  an  Up 
and  Down  Transformed  Response  strategy  to  empirical  problems  are  discussed. 
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The  basic  characteristics  of  a sensitivity  test  ares  a stimulus, 
a test  specimen  and  a response  (0  or  1).  Associated  v:ith  each  test  is 
a critical  stimulus  or  strength  such  that  if  the  stimulus  exceeds  the 


strength,  the  specimen  responds  and  vice-versa.  The  distribution  of 
strength  is  called  the  response  distribution  or  the  response  function. 

Based  on  quantal  response  data,  v/e  want  to  estimate  the  response  function, 
the  extreme  percentage  points  and  the  probability  of  response  at  a critical 
level  of  the  stras3  variable. 

There  exist  several  well  known  statistical  techniques  for  treating 
a quantal  response,  but  the  method  in  (l)  which  is  the  '-'eibull  Sensitivity 
Model  and  has  been  employed  in  the  applications  which  follow  has  a number 
of  advantages  over  the  standard  procedures.  These  advantages  are  the 
following : 

1)  Bobust  to  unknown  true  response  distribution. 

2)  Minimizes  the  need  "or  variable  transformations. 

3)  Capable  of  assuming  a wide  variety  of  distribution  shapes  which 
allows  the  approximation  of  many  response  curves,  including  the  normal, 
over  local  regions  and  over  the  entire  domain  of  the  response  function. 

The  main  disadvantage  of  the  Weibull  sensitivity  model  is  that  it  is  a 3 
parameter  distribution,  and  the  location  parameter  is  sometimes  difficult 
to  estimate. 

Two  of  the  cotter  kno;-n  and  frequently  used  sequential  sensitivity 
test  methods  are  the  Up  and  Down  Test  (2)  and  the  Langlie  One  Shot  Test 
Strategy  ( 3).  For  the  Up  and  Down  Test  one  item  is  tested  at  a time 
starting  at  the  best  initial  estimate  of  the  50"?  response  point.  The 
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tent  level  Is  moved  up  one  step  after  each  negative  response  and  down 
one  step  after  each  positive  response.  The  step  size  is  fixed  and  mist 
be  determined  in  advance  of  the  test.  This  method  tends  to  concentrate 
the  observations  near  the  mean  of  the  distribution.  As  a result,  the 
method  is  generally  good  in  estimating  the  mean  or  $Ot  point  of  a 
symmetric  distribution  but  does  not  do  too  well  with  extreme  percentage 
points. 

Langlie  developed  a sequential  test  strategy  that  overcame  the 
difficulty  with  the  Up  and  Down  method  in  predetermining  the  step  size. 

This  strategy  makes  use  of  continuously  variable  stress  levels  and  is 
insensitive  to  the  starting  level  and  does  not  require  specifying  a 
prior  step  size.  The  analysis  is  based  on  a normal  response  distribution 
and  has  been  shown  to  oe  more  efficient  than  the  Up  and  Down  in  estimating 
the  mean  and  standard  deviation  of  the  response  distribution. 

Often,  however,  the  experimenter  is  Interested  in  the  response  function 
at  the  extreme  ends  of  the  distribution.  This  generally  requires  data 
to  be  obtained  from  the  local  region  of  Interest. 

Vetherill  (U),  in  19 >3,  published  the  results  of  an  investigation  of 
sequential  test  methods  for  the  estimation  of  general  percentage  points 
of  a qu antal  response  function.  He  found  available  procedures  like  the 
Up  and  Down  to  be  unsuitable  for  estimation  of  extreme  percentage  points. 

He  proposed  a rule  for  transforming  the  response  in  an  Up  and  Down  Teat 
so  that  observations  would  be  concentrated  in  the  tail  areas.  In  Dr. 
Einbinder's  test  strategy,  the  Wetherill  transformation  is  applied  to 
the  Langlie  One-Shot  test  algorithm.  This  procedure  is  referred  to  as 
the  One-Shot  Transformed  Response  Strategy  (OSTR  for  short). 
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The  applications  I shall  describe  feature  an  example  of  the  Up  and 
Down  Transformed  Response  Stratefy  (UJTR  for  short)  and  the  Langlie  One- 
Shot  Transformed  Response  Strategy.  Estimation  of  the  extreme  percentage 
points  is  accomplished  usin'-  a Ueibull  distribution  as  a response  function. 
The  basic  rationale  for  the  new  tost  strategy  and  estimation  methodology 
include  testing  in  or  close  to  the  region  of  interest,  using  a variable 
level  strategy,  using  a sequential  stratepy,  using  a locally  best 
approximation  if  the  response  model  is  not  known. 

Tho  new  test  strategy  involves  a transformation  procedure  which 
is  defined  in  Tables  1 and  2.  The  transformation  is  defined  by  the 
value  of  No  which  determines  the  response  quantile  around  which  the  test 
levels  tend  to  concentrate.  This  quantile  is  called  the  transformed 
median  percentage  (TKP).  For  No-3,  the  TKP-79.32*.  The  response  trans- 
formation is  designed  to  make  an  increase  in  stress  easier  than  a decrease. 
The  greater  the  difficulty  in  decreasing  the  stress  level,  the  greater 
will  be  the  transformed  median  percentage.  For  P».5  as  shown  in  Table  1, 
a positive  response  is  denoted  by  an  :(  or  1 and  a negative  response  by  0. 

A type  D response  which  requires  a reduction  in  stress  level  is  allowed 
to  occur  after  Mo  confirmations  of  a positive  response.  For  P<.5the  U's 
and  D's  are  redefined  as  shown  in  Table  2. 

A change  of  response  type  is  said  to  occur  when  an  alternation  of 
response  occurs.  Wetherill  proposed  a stopping  rule  based  upon  a 
specified  number  of  changes  of  response  type  rather  than  on  a fixed 
number  of  trials.  Based  upon  WetherUl's  results,  and  our  experience  with 
this  strategy,  a minimum  of  *>-6  changes  of  response  is  advocated.  The 
number  of  observations  required  In  an  experiment  is  a random  variable  with 
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this  stopping  rule.  The  expected  sample  size  with  a particular  number 
of  changes  of  response  increases  with  No  or  the  farther  out  in  the 
tails  of  the  response  curve  in  which  testing  takes  place, 

3.  Discussion 

Next,  we  describe  two  actual  applications  of  this  new  sensitivity 
test  strategy. 

3.1  Objective 

During  the  process  of  acceptance  testing  of  an  artillery  fuze,  it  was 
found  that  the  fuze  armed  at  a distance  of  10  feet  from  the  gun  muzzle. 

This  condition  was  unsatisfactory  since  the  fuze  specifications  required 
that  no  fuze  arming  occur  at  a distance  of  10  feet  from  the  muzzle. 

A test  program  was  subsequently  undertaken  to  examine  the  fuze  arming 
distance  distribution  in  the  lower  tail  in  order  to  determine  the  following: 

a.  If  the  fuze  specification  acceptance  test  criterion  for  arming 
was  reasonable  for  the  An*  design. 

b.  If  not,  to  decide  on  a suitable  alteration  which  would  provide 
the  desired  quality  control  on  safe  arming. 

3.2  Recommended  Test  Flan 

In  order  to  accomplish  the  desired  objective,  a statistical  test  program 
was  designed,  tailored  to  the  new  sequential  sensitivity  test  strategy. 

The  objective  was  to  determine  the  distribution  of  distances  at  which 
fuze  arming  occurs  or  conversely  the  distribution  of  target  ranges  at 
which  fuse  functioning  does  not  occur.  Primary  interest  was  in  ascertaining 
a safe  gun-to-target  distance  which  involved  finding  a distance  at  which 
a small  probability  of  functioning  would  be  expected  to  occur. 
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K One-Shot  Transformed  Response  Sequential  Sensitivity  Test  plan 
using  an  No- Ut  was  selected  and  implemented.  Since  fuze  safety  was  the 
principal  problem  we  wore  interested  in  defining  the  lower  tail  of  the 
response  distribution  as  accurately  as  possible  within  the  limitations 
of  time,  hardware  and  cost.  Using  an  No«lh  response  strategy  tended  to 
concentrate  the  tests  in  the  neighborhood  of  the  lower  5 4 region  of  fuze 
arming.  Simulation  data  previously  conducted  to  estimate  the  required 
sample  size  indicated  that  about  150  teats  would  probably  be  needed  to 
obtain  6 changes  of  response.  Fuze  function  at  a given  range  was  defined 
as  a positive  response.  One  (1)  represents  fuse  function  and  zero  (0) 
is  non-arming  or  non- function.  The  response  mat  be  defined  such  that 
an  increase  in  stress  level  results  in  increasing  the  probability  of  a 
response.  Then  to  obtain  a type  U response,  we  have  to  observe  lli  tests 
conducted  at  a given  target  distance  without  anting,  i.e.,  11+  zero 
responses  before  increasing  the  range  to  the  target.  If  a fuze  function 
was  obtained  at  a giver  stress  level  or  target  range  before  a sequence 
of  lli  zeros  was  completed,  then  a type  D response  is  said  to  have  occurred 
which  required  a decrease  in  target  range. 

Test  limits  were  3et  at  0 and  100  feet  from  the  gun.  The  first  test 
level  was  set  at  50  feet  from  the  gun  and  testing  continued  until  a fuze 
function  occurred.  The  response  was  classified  as  type  U or  D according 
to  the  criteria  described  above.  Testing  was  continued  by  setting  each 
subsequent  level  of  test  halfway  between  a 0 and  U response.  If  ouch  an 
alternation  gig  not  occur,  the  procedure  cons is ted of  going  back  in  the 
sequence  of  outcomes  until  an  equivalent  number  of  D' s and  U' s were  found. 
The  next  test  level  is  the  average  of  the  stress  levels  corresponding 
to  these  outcomes.  Where  U's  and  0's  could  not  be  averaged  (i.e.,  where 
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an  equivalent  muter  of  D*s  andU'a  were  not  obtained)  subsequent 
levels  were  averaged  iqr  using  the  lower  limit  for  a type  D response 
and  the  upper  limit  for  a type  U response*  Testing  continued  until 
al]  of  the  150  rounds  were  tested.  These  results  are  shown  in  Table  3. 

Analysis  of  test  data  from  the  150  rounds  showed  the  point  estimate 
of  probability  of  arming  to  be  .002  at  10  feet  and  the  upper  95%  confidence 
level  of  probability  to  be  .010.  With  these  probabilities  of  arming  in 
mind,  the  fuse  engineers  decided  to  test  fire  an  additional  1*0  rounds  at 
10  feet,  hoping  to  get  no  arming.  Much  to  their  dismay,  2 rounds  out  of 
1*0  tested  at  10  feet,  were  found  to  am.  This  result  is  not  considered 
to  be  inconsistent  with  the  previous  performance  estimates  resulting 
from  analysis  of  the  teat  data  from  the  1$0  rounds.  Thus,  if  the  probability 
of  functioning  la  .010  as  estimated  from  the  150  rounds,  the  probability 
of  observing  2 functions  out  of  1*0,  given  that  the  probability  of 
functioning  is  .010,1a  .060. 

Test  data  from  the  1*0  rounds  were  subsequently  aggregated  with  the 
other  150  data  values  and  revised  probabilities  were  obtained.  The 
estimate  of  the  probability  of  arming  at  10  feet  was  estimated  to  be 
,015  as  a point  estimate  and  the  upper  95%  confidence  level  of  probability 
wa a estimated  to  be  .037. 

The  following  conclusions  were  drawn  from  the  analysis  t 

1)  The  current  fuse  design  cannot  meet  the  Mil-Standard 
non- arming  requirement  at  10  feet,  with  any  high  degree  of  reliability. 

2)  Either  the  arming  distance  acceptance  test  requirement 
has  to  be  changed  or  the  design  modification  to  accomodate  a 10  foot 
arming  distance  characteristic. 
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TABLE  3 


JEFFERSON  PROVING  GROUND 


M603  FUZE  TEST  RESULTS  - April  19  7h 


STIVULDS 

I 

(ft) 

RESPONSE 

1 

60.0000 

0 

2 

60.0000 

1 

3 

25-0000 

0 

li 

26.0000 

0 

6 

25.0000 

0 

6 

25.0000 

0 

7 

25.0000 

0 

8 

25.0000 

0 

9 

25.0000 

0 

10 

25.0000 

0 

11 

25.0000 

0 

12 

25.0000 

0 

13 

26.0000 

0 

lli 

25.0000 

0 

15 

25.0000 

0 

13 

25.0000 

0 

17 

37.5000 

1 

18 

31.2500 

0 

19 

31.2500 

0 

20 

31.2500 

0 

21 

31.2500 

0 

22 

31.2500 

0 

23 

31.2500 

0 

2li 

31.2500 

0 

2? 

31.2500 

1 

26 

15.5250 

0 

27 

15. 5250 

0 

28 

15.5250 

0 

29 

15.5260 

0 

30 

15.5250 

0 

31 

15.6250 

0 

32 

15.6250 

0 

33 

15.6250 

0 

3li 

15.6250 

0 

36 

15.6250 

0 

36 

15.6250 

0 

37 

15.6250 

1 

38 

7.8130 

0 

39 

7.8130 

0 

hO 

7.8130 

0 

RESPONSE 

TYPE 


D 


U 

D 


D 


D 


NUMBER  OF 
CHANGES 


1 

2 


1 - Function 
0 * Non-Function 

U = lU  (O's) 

D - 13  (°'a),  1 fctc. 
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I 

(ft*- 

lil 

7.8130 

1*2 

7.8130 

1*3 

7.8130 

hh 

7.8130 

1*5 

7.8130 

1*6 

7.8130 

1*7 

7.8130 

1*8 

7.8130 

1*9 

7.8130 

50 

7.8130 

51 

7.8130 

52 

11.7190 

53 

11.7190 

51* 

11.7190 

55 

11.7190 

56 

11.7190 

57 

11.7190 

58 

11.7190 

59 

11.7190 

50 

11 .7190 

51 

11.7190 

32 

11.7190 

S3 

11.7190 

61* 

11.7190 

65 

11.7190 

36 

21.1*850 

67 

21.1*850 

68 

21 .1*850 

39 

21.1*850 

70 

21.1*850 

71 

21.1*850 

72 

21.1*850 

73 

21 .1*850 

71* 

21.1*850 

75 

21.1*850 

76 

21 .1*850 

77 

21.1*850 

78 

21.1*850 

79 

21.1*850 

80 

29  .U930 

TABU  3 (POUT'D) 


jgynan  hptbp  owowd 


M5D3  mz*  TPT  *«HJ3  - April  XyTU  fft.niinnti) 


1 • Function 
0 - Mon- Function 

U - 1U  <0«s) 

D - 13  (O' a),  1 etc. 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 • 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


OF 

caucB 


A 

* 

t' 

( 

h 
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TABLE  3 (CONT'D) 


JBPFaiaOW  WO  VINO  OBCCMD 
W503  FtJ7t  TEST  RgPUS  - April  1971*  (Continued) 


I 

sr  mums 
(ft) 

RESPONSE 

81 

29.1*930 

1 

8? 

25.1*690 

0 

83 

’5.1*990 

0 

91* 

25.1*890 

0 

85 

25.1*890 

0 

8 i 

25.1*390 

0 

37 

25.1*890 

0 

89 

25.1*890 

0 

90 

25.1*890 

0 

91 

25.1*890 

0 

9? 

25.1*890 

0 

93 

25.1*990 

0 

9U 

25.1*890 

0 ' 

95 

25.1*890 

0 

96 

27.1*950 

0 

97 

27.1*950 

0 

98 

27.1*9jO 

0 

99 

27.1*950 

0 

100 

27.1*950 

0 . 

101 

27.1*9jO 

0 

10? 

27.1*9  jO 

0 

103 

27.1*9  50 

0 

iai* 

27.1*950 

0 

105 

27.1*9  50 

0 

105 

?7.1*9  50 

0 

107 

27.1*950 

0 

108 

27.1*960 

0 

109 

27.1*960 

0 

110 

38.7980 

1 

111 

33.11*70 

0 

11? 

33.11*70 

1 

Ill 

29.3180 

0 

lilt 

29.3180 

0 

115 

29.3180 

1 

11  ■) 

20.5190 

0 

117 

20.5190 

0 

118 

20.5190 

0 

119 

20.5190 

0 

120 

20.5190 

0 

1 * 

0 - 

Function 

Non-Function 

RESPONSE  MDHBSR  OF 

type  auwog 


D 


U 


U 


5 


u 

D 6 

D 


0 


o • iu  (O's) 

D - 13  (O's),  1 etc 
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jiff  mow  pm  vac  qwoomp 

H90:  1K3S  TBff  ftgQLTS  - April  197U  (Continued) 


1 ■ Function 
0 * Non-Function 


U r lU  (0»e) 

D . 13  (O'o),  1 


T 

STVtXJOS 

(ft)_ 

MSfOlB! 

121 

20.5190 

0 

122 

20.5190 

0 

123 

20.5190 

0 

12U 

20.5190 

0 

125 

20.5190 

0 

126 

20.5190 

0 

127 

20.5190 

0 

128 

20.5190 

0 

129 

20.5190 

0 

130 

21**9190 

1 

131 

22.7190 

0 

132 

22.7190 

0 

133 

22.7190 

0 • 

I3i* 

22.7190 

0 

135 

22.7190 

0 

136 

22.7190 

0 

137 

22.7190 

0 

138 

22.7190 

0 

139 

22.7190 

0 

iao 

22.7190 

0 

lai 

22. 7190 

0 

11*2 

22.7190 

0 

lli3 

22.7190 

0 

11*1* 

22.7190 

0 

11*5 

23.8190 

0 

11*5 

23.8190 

0 

Ih7 

23.8190 

0 

11*8 

23.8190 

0 

H»9 

23.8190 

0 

150 

23.8190 

0 

151 

10.0000 

0 

152 

10.0000 

0 

153 

10.0000 

0 

151* 

10.0000 

0 

155 

10.0000 

1 

155 

10.0000 

0 

157 

10.0000 

0 

158 

10.0000 

0 

159 

10.0000 

0 

160 

10.0000 

0 

RESFOMBS 

rm 


NDHBSR  OF 
CHAN05S 


V 

D 


7 

8 


etc. 
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TABLE  3 (CONT 


D 


ji^FEnsori  pr.r//it:g  ground 


M503  FUZE 

TEST  RESULTS 

1 

STIMULUS 

(ft) 

RESPONSE 

161 

10.0000 

0 

162 

10.0000  . 

0 

163 

10.0000 

0 

164 

10.0000 

0 

165 

10.0000 

0 

166 

' 10.0000 

0 

167 

10.0000 

0 

168 

10.0000 

0 

169 

10.0000 

0 

170 

10.0000 

0 

171 

10.0000 

0 

172 

10.0000. 

0 

173 

10.0000 

0 

174 

10.0000 

0 

175 

10.0000 

0 

176 

10.0000 

0 

177 

10.0000 

0 

170 

10.0000 

0 

179 

10.0000 

0 

180 

10.0000 

0 

181 

10.0000 

1 

102 

10.0000 

0 

183 

10.0000 

0 

184 

10.0000 

0 

105 

10.0000 

0 

106 

10.0000 

0 

1C7 

10.0000 

0 

183 

10.0000 

0 

189 

10.0000 

0 

190 

10.0000 

0 

April  1974 

RESPONSE 

TTPS 


1 - Function 
0 » Non-Function 


(Continued) 

N0MB3R  OP 
CHAMPS 
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| 3)  Without  having  conducted  the  fuse  testa  In  tha  prescribed 

2S 

sequential  Manner,  the  arsing  response  distribution  at  the  lower  end 
could  not  have  been  determined  with  the  sans  precision  using  tte  limited 
sanple  slse. 

In  the  next  application  of  sensitivity  testily,  the  objective  was 
to  determine  the  Minimal  quantity  of  propellant  charge  required  to  eject 
a projectile  from  a gun  tube,  in  Up  and  Down  Transfornsd  Response  (UDTR) 
was  utilised,  sines  a limited  number  of  projectiles  were  available  for 
test,  and  it  was  impractical  to  vary  the  test  levels  in  a continuous 
manner. 

A sequential  sensitivity  test  program  was  conducted  by  varying  the 
levels  of  propellant  charge  volume  for  low  sons  firing.  Interest  was 
focused  on  predicting  the  probability  of  projectile  stloklxy  when  a oompOate 
low  a one  propellant  charge  is  employed. 

In  the  loading  of  a projectile  into  a gun  tube,  the  projectile  la 
rammed  into  the  tube  after  which  a propellant  charge  is  employed  to 
eject  the  projectile  from  the  gun  tube.  When  the  propellant  charge  is 
insufficient  to  expel  the  projectile,  the  latter  sticks  in  tha  gun  tube 
causing  en  unsafe  and  undesirable  condition. 

Our  problem  was  to  evaluate  tha  probability  of  sticking  for  a 
standard  projactlls  and  a Modified  version  of  tha  standard  projectile 
when  a coaplete  low  tone  propellant  charge  is  used.  The  standard  projectile 
served  as  a baseline  for  comparing  tha  new  projectile. 

3.3  Test  Flan 

A sequential  sensitivity  test  plan  was  designed  to  vary  propellant 
charge  volume  by  a delta  of  10  os.  starting  at  approximately  1/2  low 
sone  propellant  charge  voIumb. 
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| test  procedure  was  implemented.  The  response  strategy  of  No»li  for  6 

changes  of  response,  requiring  approximately  30  rounds  was  utilized  in 
the  interest  of  expediency  and  limitations  on  hardware.  A type  D response 

» 

\ consisted  of  the  outcomes  (1111)  while  a type  U response  consisted  of 

t 

1 (1110),  (110),  (10),  or  (0)  where  0*  a sticker  and  1*  non  sticker. 

f Tables  k ft  5 show  actual  test  results  obtained'  during  the  test  program 

from  tests  conducted  on  projectiles  1 and  2.  Tests  on  projectile  1 were 
conducted  in  accordance  with  an  No-U  strategy  and  6 changes  of  response 
stopping  rule.  The  delta  used  was  5 oz.  instead  of  the  originally  intended 
10  oz.  and  close  to  the  end  of  the  test  program  the  delta  was  reduced  to 
2.5  oz.  in  order  to  obtain  an  overlap  region  of  test  results  (*.g.  sticker 
and  non-sticker).  Teste  on  projectile  2 did  not  conform  to  the  prepared 
test  strategy  but  rather  to  an  inverse  sampling  procedure  (where  several 
tests  were  conducted  with  a given  charge  volume  before  decreasing  charge 
volume  for  subsequent  tests).  The  test  data  resulting  from  these  tests 
consisted  of  quantal  responses  which  were  amenable  to  analysis  using  our 
Waibull  model. 

Results  of  analysis  of  test  results  from  projectiles  1 and  2 are 
shown  in  Figure  5.  The  curves  shew  the  90*  percentile  of  non-stick 
to  be  32  oz.  or  38 % of  full  charge  volume  for  projectile  2 vs.  58  oz 
or  68 i,  of  full  charge  volume  for  projectile  1 (full  charge  was  85<). 

The  modified  design  ( Projectile  1)  showed  a greater  propensity  to  sticking 
at  less  than  full  charge  volume.  However,  extrapolating  the  estimated 
response  functions  to  full  charge  volume,  the  probability  of  sticking  at 
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full  charge  for  each  of  the  projectiles  was  estimated  to  ho  1 in  a 
million.  Oloarly,  to  vorlfy  such  small  probability  would  require  m 
extremely  large  number  of  testa  at  full  charge  (in  the  order  of  22,000 
tests  with  no  stickers  in  order  to  validate  a probability  of  stick  of  1 
in  10,000). 

U.  Sunmary  and  Conclusion 

The  two  applications  referred  to  herein  represent  actual  examples 
of  the  successful  implementation  of  the  Wetherill  (UDTR)  and  Elnbinder 
(OSTR)  sequential  sensitivity  test  procedures  and  the  Nbibull  response 
model.  The  OSTR  procedure  has  also  been  applied  to  evaluate  fuze  detonator 
safety  by  determining  the  distribution  of  out-of-line  distances  for 
non-propagation  of  the  explosive  train. 

Other  applications  of  the  test  procedure  have  been  implemented.  For 
example,  it  has  recently  been  used  for  estimating  ballistic  limit 
distributions  of  penetrators.  3h  this  connection,  the  procedure  was 
used  to  estimate  hazard  velocity  levels  for  plastic  fragments  in  terms 
of  perforating  1 cm.  gelatin  blocks. 

In  summary,  similar  sequential  sensitivity  test  programs  have  been 
used  rather  successfully  in  quant itatively  assessing  the  effect  of 
environmental  treatments  and  design  changes  on  munition  functioning  and 
safety.  In  each  instance,  the  effect  of  a single  variable  is  assessed 
by  allowing  that  variable  to  vary  by  discrete  levels  and  obtain  responses 
at  each  level  of  test. 

^rticularly  has  this  new  method  been  helpful  in  estimating  the 
response  function  locally  over  some  low  or  high  region  of  interest.  It 
also  affords  estimates  of  percentage  points  of  the  response  distribution 
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\ and  probabilities  of  responet  at  specified  levels  of  a stress.  Our 

computer  program  produces  point  estimates  and  confidence  level  estimates 
of  reliability  and  percentage  points. 
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STATISTICAL  ARALYSIS  ART  MODELIRG  OF  SERSITIYITY  AURT-TSTATICK 
III  CUTAKBOUS  COSWUHICATIORT 


R.  D'Aceardi  and  H.  S.  Bennett 


US  Army  Electronics  Command,  Fort  Monmouth,  New  Jersey 


ABSTRACT . Since  about  1965*  the  US  Army  Electronics  Command  has  sup^rtec 
several  investigations  dealing  vith  a new  net hod  of  cccauni cat lens  by  which 
information  transfer  takes  place  through  excitation  of  the  peripheral 
nerve  endings  located  in  the  dermis.  The  systen  involves  skin  stimulation 
by  an  electrical  pulse  transmitted  in  a Morse  code-like  pattern-  Two  sra 1 
electrode  pins  mounted  in  a plastic  holder  attached  to  the  subject’s  ferearr. 
provided  the  signal  mechanism.  One  type  of  data  was  obtained  when  a cutaneous 
"cuing'*  signal  was  used  as  a precursor  in  the  standard  Fairbanks  Rhyme  audio 
test  in  order  to  test  its  ability  to  increase  aural  acuity. 

The  object  of  this  presentation  is  threefold.  First,  we  discuss  the  design 
of  the  experiment,  classification  of  subjects,  and  techniques  for  sensitivity 
augmentation  (electrical  excitation  of  the  peripheral  nerve  endings  in  the 
dermis).  Secondly,  we  present  statistical  estimates  of  (a)  the  effects  of  the 
controlled  variables  (i.e.,  level  of  awareness,  and  audio  noise  level)  upon 
response,  and  (b)  the  Independence  of  these  estimated  effects.  Thirdly,  we 
present  a realistic  two-dimensional  characteri saticn  of  aural  acuity  with 
cutaneous  "cuing,"  over  a range  of  values, for  a prescribed  level  of  confider.ee, 
within  which  valid  values  of  the  model  parameters  may  he  found. 

Among  other  things,  cutaneous  coonmication  is  intended  tc  strengthen  and 
enhance  present  audio  and  visual  electronic  ccemmi cations  by  "alerting" 
communicators  In  a tactical  environment  to  the  fact  that  a standard  audio  or 
visual  message  will  follow.  In  this  regard,  the  I formation  obtained  free 
the  statistical  analysis  of  the  effects  of  "cuing"  should  provide  useful 
Information  on  nan-machine-interface  characteristics  for  future  design  of 
such  "cued"  communication  systems. 

INTRODUCTION . The  general  goal  of  cutaneous  communications  is  to  provide 
an  effective  means  of  improving  and  supplementing  present  tactical  communi- 
cations, especially  in  a noisy  environment.  A system  of  this  nature  could 
be  used  for- 

1.  Warning  signals 

T.  Alerting  or  cuing  signals 

3.  Coded  message  traffic 

Priority  one-way  communication 
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Tt  can  be  applied  to  sentry  operations.  Airborne  or  Airmobile  operations,  J 

Army  Aviation  support,  small  unit  Reconnaissance,  and  Armored  Operations  | 

where  noise  and  distractions  from  a tactical  environment  may  adversely  affect 
the  performance  of  radio  operators  and  comcenter  personnel.  In  rapidly  4 

developing  situations  and  reconnaissance  where  privacy,  radio  silence,  or  i 

back-up  communications  links  are  demanded,  pre-coded  messages  can  be  used 
in  lieu  of  normal  radio  means  for  command  and  control  of  small  units  or 
individual  soldiers.  Cuing  signals  used  as  a precursor  to  standard  message 
formats  can  be  of  use  to  armor  and  helicopter  operations  which  by  their  very 
nature  are  "noisy".  Communications  personnel  can,  therefore,  be  alerted  to 
message  traffic  which  can  save  time  and  decrease  the  probability  of  operator 
error. 

In  developing  the  concept  for  cutaneous  communication,  several  problems 
had  to  be  resolved.*  Work  by  Bennett,  Hennessy  and  McCray(l)  determined 
these  parameters  through  a series  of  pilot  experiments  which  were  designed  to 
determine  the  effects  on  sensitivity  levels  when  electrode  configurations , 
pulse  width,  pulse  frequency,  and  pulse  type  were  varied.  Other  experiments  « 

were  conducted  to  (a)  determine  if  one  could  detect  individual  pulses  of  various  j 
word  rates;  (b)  determine  effects.  of  different  metals  on  threshold  feeling  j 

and  discomfort;  (c)  determine  the  use  of  "shadow  signals"  to  enhance  individual  ! 

acuity;  and  (d)  determine,  optimally,  the  lowest  possible  signal  power  to  I 

produce  sensation  on  the  skin.  The  results  of  their  work  are  well  documented (2, 3)  j 
and  they  show  the  optimal  signal  which  proved  the  most  efficient,  both  from  ' 

the  viewpoint  of  minimization  of  power  and  acuity  of  response,  was  a bi-phasic  j 

rectangular  pulse  of  0.25  msec  duration  applied  at  a 300  pps  rate,  using  i 

a pin-type  electrode  of  surgical  steel  with  constant  current  in  the  .33  to  j 

0.5  ma  range  at  10  to  30  volts  excitation.  With  this  information,  a series  j 

of  experiments  were  conducted  using  a cutaneous  "cuing"  signal  as  a precursor  I 
signal  to  the  auditory  Fairbanks  Rhyme  Test.  These  experiments  were  concerned  i 
with  the  effects  of  two  variables  (level  of  awareness,  and  acoustic  noise)  <' 

at  several  levels.  The  level  of  awareness  (cuing  signal)  was  compared  with  j 

the  recognition  of  a random  selection  of  phonemes  both  in  the  presence  of 
masking  electrical  noise,  and  in  a noise-free  environment.  It  remained  to 
determine  whether  or  not  "cuing"  improved  the  accuracy  of  phoneme  recognition  i 
in  the  presence  of  noise.  This  paper  is  concerned  with  the  analysis  and  i 

modeling  of  these  effects.  I 


*Tn  order  to  bo  "optimal"  such  a system  had  to  excite  the  double  layer  of 
Na/  and  K / ions  which  surround  the  nerves  in  the  dermis.  This  means  that 
parameters  such  as  electrode  configuration,  optimal  power  requirements, 
pulse  shape,  and  pulBe  duration  have  to  be  determined  before  the  system 
becomes  a reality. 
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PEgIGN  qv  TUK  KXFf':‘ IMF!”!  . The  experiment,  vju  conduced  at  Fort  V. •:a-.o,:th. 

N ev  Jersey,  where  the  standard  Fairbanks  Rhyme  test  vnr-  administered  *.  earn* 
subjects.  The  test  consisted  cf  a i-indcm  se’ec tier,  o'"  ' "Iced  and  unvoice*: 
stops,  fricatives,  and  liquids  and  seni-vove’r . There  were  two  far.: i : ar ' -.at  ten 
sessions  and  eight  data  acquisition  sessions  taken  over  a five-day  peril , d. 

AH  subjects  were  screened  for  normality  of  binaural  hearing  and  for  other 
medical  requirements  as  prescribed  by  the  staff  at  Patterson  Army  Hospital. 

The  familiarization  sessions  consisted  of  el ectrocutaneous  cuing  and  auditory 
reception  and  transcription  of  250  phonemes.  Data  acquisition  sessions 
consisted  of  the  same  process  accompanied  by  several  treatments  or  levels  of 
"excitation"  and  "noise".  The  first  familiarization  sessions  conditions  were 
equivalent  to  a "no-noise"  environment.  The  second  was  a "noisy”  environment. 

For  each  of  the  ten  sessions,  the  cutaneous  sensation  threshold  level , (CSTL) , 
of  each  subject  vas  determined  prior  to  testing.  After  each  page  of  55  phonemes, 
the  threshold  was  rccLecked  and  reset  if  necessary.  Thus,  for  each  session  of 
250  phonemes,  for  each  subject,  there  are  five  distinct  threshold  measurements 
from  which  the  level  of  the  variable,  SCSTL,  was  calculated. 


The  Fairbanks  Rhyme  test  was  "taped"  and  administered  to  simulate  both  the 
"No-noise"  and  "noisy"  environments.  Therefore,  in  the  preliminary  analysis 
of  the  data,  we  were  concerned  with  the  effects  of  two  variables  at  two 
levels,  where  the  combination  of  two  levels  of  CSTL  ar.d  noise  are  compared 
using  the  correct  phoneme  recognition  as  the  Joint  response  variable  for  all 
subjects.  The  experiment  was  well  suited  far  the  2*  factorial  design  with 
replication.  A possible  model  for  this  randomized  design  is: 


Yij  * u + Ai  ♦ Bj  ♦ AB*j  ♦ Ejj  (1) 

where:  Aj  • CSTL  (cuing  factor  at  OJC  CSTL  and  at  125*  CSTL), 

Bj  * Environmental  factor  (no-noise  and  noisy) , 

ABij  * Interaction  of  main  effects, 

Ejj  « Experimental  error,  and 

Yjj  = Correct  phoneme  response  for  all  subjects. 

The  subjects  were  divided  into  two  groups  of  four  each.  Each  group  vas 
prepared  with  one  familiarization  session,  and  tested  with  four  sessions 
of  250  phonemes.  At  the  "0"  noise  and  -5  dB  noise  levels,  information  was 
recorded  once  per  session  giving  a replication  of  four  observations.  The 
results  herein  pertain  to  the  group  II  subjects. 


ANALYSIS  OF  VARIANCE.  The  group  II  data  was  used  to  determine  if  phoneme 
recognition  is  improved  in  the  presence  of  environmental  (acoustic)  noise  with 
cutaneous  cuing.  As  previously  mentioned,  two  levels  of  CSTL  were  chosen, 
i.e.,  0?  CSTL  and  125?  CSTL,  to  simulate  cue  and  no-cue  conditions.  Likewise, 
two  levels  of  environment  were  defined,  i.e.,  two  S/N  ratios  corresponding  to 
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n<. -noise  and  noisy  conditions  respectively.  The  response  variable  is 
correct  phoneme  response,  l.e.,  recognizing  the  phoneme  B,  given  £ as  the 
stimulus,  etc.  The  model  chosen  for  this  randomized  design  is: 


Yi.1  = u + Ai  ♦ Bj  ♦ AP-ij  + Hij 

where  the  variables  are  defined  as  in  equation  (l)  above.  The  effect  of  each 
factor  ic  defined  as  the  change  of  response  variable  produced  by  either  a 
change  in  the  levels  of  Ai,  Bj , or  both. 

Table  1 shows  the  treatment  combinations  and  the  associated  measured  responses. 
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Table  1 - Treatment  Combinations 


The  responses  have  been  normalized  to  indicate  the  percentage  of  correct 
phoneme  response  to  the  nearest  per  cent.  From  this  data,  the  following 


A1I0VA  wan  calculated: 

Source 

Degrees  of 
Freedom 

Sum  of 
Squares 

Mean  S , ire 
Error 

F 

Ratio 

CSTL 

1 

25.0 

25.0 

0.81* 

Fnvironment 

1 

225.0 

225.0 

7.58 

Interactions 

1 

81.0 

81.0 

2.73 

Error 

12 

356.0 

29.67 

Total 

15 

687.0 

From  this  information  at  the  959  level  of  significance,  using 
K-j  ^ig ( . 95 ) ■ 4.75  from  the  standard  F distribution,  it  is  obvious  that 
environment  is  significant  and  has  a strong  effect  on  correct  phoneme 
response.  From  the  interactions,  it  is  apparent  that  the  various  combi- 
nations of  environment  arid  cuing  are  not  significant.  This  is  interpreted 
as  an  indication  that  at  least  a 5 dB  improvement  in  effective  5/N  ratio  ia 
realized.  In  other  vords,  performance  rw sins  essentially  the  same  in  a 
noisy  environment  with  and  without  cuing,  whereas  with  no-cuing  there  ia 
a significant  deterioration  in  performance  observed  when  going  from  a 
no-noise  to  a noisy  environment. 


REGRESSION  MODEL.  In  an  effort  to  arrive  at  a realistic  two-dimensional  regres- 
sion model  which  would  dercribe  any  possible  sub-liainal  effects,  i.e. , 
changes  in  performance  av  ::uy  50t  and  759  CSTL,  the  following  linear  model  waa 
considered: 


♦ B.X 


'1A1+ 


B2X2, 


+ B3X  X2  + E, 


t 


U 


where  Yt  *>  correct  phoneme  response 
X^  = level  of  CSTL 
X2t  * level  of  environmental  noise. 

The  level,  X2t>  va3  designated  either  0 or  1 to  correspond  to  low  and  high 
levels  of  noise,  and  X]^  levels  are  0.00,  0.75»  1.00,  and  1.25  respectively . 

The  purpose  of  this  model  was  to  establish  a mathematical  relationship  to 
describe  the  effect  of  varying  CSTL  in  either  environment.  That  Is,  to 
determine  response  as  X1  is  varied  from  0,  to  1.25  in  the  steps  indicated. 

The  intent  was  to  map  any  possible  sub-liminal  effects  occurring  below  the 
threshold  of  sensation.  Assuming  the  true  relationship  between  environment, 
cuing,  and  response  is  linear,  then  the  failure  of  the  observed  values  to 
lie  on  the  straight  line  is  a function  of  experimental  errors.  If  the 
differences  are  also  the  result  of  an  Inadequate  model,  then  a higher  order 
model  would  have  to  be  formulated.  Assuming  the  linear  model  adequate,  the 
least  squares  estimates  of  the  parameters , the  respective  959  confidence  bounds 
for  Ej,  and  estimates  of  standard  error  are: 
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This  provides  the  model: 


5^.6  ♦ 2.5X,  -2** . 3X«  + 17. OX, 


Testing  for  linearity,  the. sum  squared  error  and  respective  d.  f.  (lack  of 
fit)  for  the  variation  of  Yj  from  a straight  line  is  2 9.7  and  df  * 2 
respectively.  If  the  model  in  correct,  the  residual  mean  square  haB  the 
expected  value  of  °y'1.  Using  S2  ■ ce2  ■ 31.78  * MSft,  the  "F"  ratio: 

F - M^l  ■ lit. 36  ■ 0.U5 


■ lit. 36  ■ 

JS^  3T7F 


and  is  not  significant  since  it  is  less  than  unity.  Thus,  on  the  basis  of 
this  test  at  least, ve  have  no  reason  to  doubt  the  adequacy  of  the  model  and 
one  can  use  S 2 » 31.78  as  an  estimate  of  0 '2.  Further,  in  examining  the 

t J * 

residuals,  (Yi  - Yi),  and  plotting  them  against  Yi,  one  can  see  that  no 
abnormality  is  indicated,  that  is,  (a)  ^ N (0,oc2),  (b)  the  variance  .is 

fairly  constant  and  there  is  no  need  for  weighted  least  squares  or  transforms 
tions  on  the  Yj,  and  (c)  model  appears  adequate.  See  figure  (l). 


Figure  1 - Residuals 

Therefore,  a valid  conclusion  is  that  this  linear  two-dimensional  model 
adequately  describes  response  as  a function  of  the  two  independent  variables, 
Xj*  and  Xji . This  technique  is  presented  to  show  the  feasibility  of  simple 
least  squares  regression  in  dealing  with  this  type  of  man-machine  interface 
problem. 
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A more  sophisticated  modeling  approach  will  be  attempted  at  a later  time 
when  more  data  becomes  available. 

REFERENCES 

1.  J.  M.  McCray  et  al,  "Cutaneous  Communications".  US  kray  Electronics  Command 
Technical  Report  # ECOM-3303,  Ft.  Monmouth,  N.J.,  June  1970. 

2.  Bennett,  II.  G. , et  al,  "The  Role  of  Skin  Impedance  in  Cutaneous 
Communications" . Proceedings  of  the  19th  Conference  on  Engineering  in 
Medicine  and  Biology,  San  Francisco,  Calif.,  Nov.  1966. 

3.  Bennett,  H.  S.,  and  J.  R.  Hennessy,  "The  Characteristics  of  the  Cutaneous 
Communications  Channel".  IEEE  International  Conference  on  Coneuni cat Iona, 
Minneapolis,  Minn.,  June  1967. 


607 


MULTIVARIATE  DATA  ANALYSIS 


Herbert  Solomon 
Stanford  University 


1.  Introduction 

There  has  always  been  a need  to  achieve  parslaonloui  yet  opera- 
tionally Meaningful  accounts  of  what  1 « going  on  in  nature  and  in  human 
behavior.  He  are  aware  of  attempts  by  biologists  to  classify  flora  and 
fauna,  and  even  that  dlchotosy  was  a aajor  step  forward.  It  is  In  the 
physical  and  life  sciences  that  we  find  the  first  quantifiers  at  work 
on  such  natters.  Later  we  find  social  anthropologists  and  psychologists 
engaging  in  studies  on  how  groupings  can  bo  accomplished.  Today  wa  find 
miner leal  tasonomy  pervasive  la  practically  every  field  of  study.  This 
has  boon  spurred  by  Increased  activity  in  data  collection  and  develop- 
ments In  conputsr  technology.  Multiple  measurements  on  elements, 
Individuals,  or  variables  abound  nowadays,  and  one  seea  Investigators 
scurrying  about  to  apply  discriminant  analysis,  classification  or 
clustering  techniques,  multidimensional  contingency  table  analysis,  factor 
analysis,  and  with  good  reason.  He  will  return  to  these  topics. 

Even  though  we  regard  classification  in  social  sciences  as  rather 
new,  it  is  difficult  to  think  of  its  counterpart  In  physical  sciences  as 
very  old  unlese  one  thinks  of  a tew  hundred  years  in  the  course  of  mankind 
as  a very  long  step.  It  waa  just  two  or  three  hundred  years  ago  that 
many  physical  ailments  were  labeled  "consumption",  because  they  were 
characterised  by  a "vesting  away  of  the  tissues".  Under  this  were 
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lumped  such  diiuiw  ia  leprosy,  tuberculosis,  diabetes,  and  others. 
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It  was  not  until  hms  tine  later  that  someone  noted  that  the  urine  of 
aoee  of  these  sufferers  was  sweet  and  that  of  others  waa  not.  Of  course, 
the  subsequent  discoveries  of  two  different  bacilli  for  leprosy  and 
tuberculosis  suggested  finer  groupings  that  obviously  ware  sore  Meaning- 
ful in  connection  with  specific  treatments. 

There  is  a lesson  hero  for  all  of  us,  namely  that  the  classification 
and  grouping  of  Individuals  or  elements  based  on  data  analyses  of  sets  of 
variables  can  lead  to  man-made  group  concoctions  that  are  artificial  and 
sonatinas  misleading . Vhat  should  be  kept  in  mind  ia  that  when  this  Is 
dona,  a grouping  has  soma  meaning  to  the  investigator.  For  the  last 
forty  years  or  so,  aberrant  mental  behavior  has  been  subjected  to  classi- 
fication and  groupings  produced  on  the  basis  of  observations  made  on  any 
number  of  variables.  For  an  individual  placed  in  one  of  these  groupings, 
soma  treatment  is  suggested.  1 imagine  one  does  not  feel  as  comfortable 
here  In  a diagnosis  as  in  the  case  of  diabetes  or  tuberculosis  groupings 
at  present;  and  rightfully  so.  Yet  treatment  will  be  undertaken  based 
on  a diagnostic  category  to  which  an  individual  is  assigned.  This  should 
give  us  pause  whan  classification  is  attempted  by  data  analysis  in  the 
newer  lnveetlgatlons  such  as  those  that  occur,  for  example,  in  the 
reenlistmsnt  decision  in  the  armed  services. 

2.  History 

It  is  in  tbs  late  19th  century  that  we  find  a bloseoniag  of 
inquiries  into  classification  through  the  selection  and  appropriate  use 
of  aaalfeat  variables.  Quite  often  a one-dimensional  index  that 
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incorporates  all  pertinent  variables  was  sought  so  that  a technician 
could  assign  an  individual  to  one  of  aaveral  groups  based  on  hla  responses 
to  the  variables  enploysd.  For  asanpla,  the  coefficient  of  racial 
likeness  was  an  index  developed  at  the  turn  of  the  century  to  distinguish 
different  national  or  tribal  group a on  the  basis  of  a set  of  physical 
aaasureaents.  Inquiries  on  association  of  criminal  types  with  physical 
neasurenents  of  individuals  also  receivad  attention  in  this  period  by 
such  investigators  as  Loabroso. 

Much  of  this  Inquiry  took  place  in  the  British  coacaunity  of  scholars. 
In  a way  it  night  ha  viewed  to  have  begun  at  least  la  a larger  sense 
with  Charles  Darwin's  vast  collection  of  data  arising  fron  hla  travels 
around  the  world.  His  diaries  presented  nsay  observations  on  the  anlasl 
klngdoa  and  served  as  a base  for  study  by  aany  who  cane  later  in  the 
19th  century. 

It  was  with  these  investigators  in  the  last  quarter  of  the  19th 
century  that  we  have  the  beginnings  of  statistical  contributions  to 
classification.  In  fact,  it  is  the  classification  problan  that  in  a 
way  activated  and  craatad  statistical  inference  as  an  area  of  scientific 
inquiry.  The  nodern  discipline  we  now  call  statistics  was  brought  about 
by  the  anthropomtrlsts,  biologists,  and  psychologists  of  that  era. 

Such  initial  contributors  to  nodern  statistics  as  Francis  Cal ton  and 
Karl  Pearson  stea  fron  that  psrlod. 

Gal ton  seened  to  be  perpetually  engaged  in  data  analysis.  Ha  and 
hie  cousin,  Darwin,  and  others  revolved  in  an  age  of  scientific  inquiry 
that  enphaslsed  eaplrielse.  Pearson,  along  with  others,  later  atteapted 
quantification  and  asthanatisatlon  fron  the  euplrlcal  analyses  provided 
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by  their  colleagues.  Galton,  whom  we  regard  as  the  founder  of 
regression  analysis  through  his  study  on  relationships  between  children's 
heights  and  parents'  heights,  also  initiated  and  developed  the  notion 
of  correlation  prior  to  1885.  The  correlation  coefficient  serves  as  a 
basic  suaamrlzation  in  aniltivariate  data  analysis  and  consequently  in 
studies  that  go  into  techniques  of  grouping,  lion  its  very  nature, 
obviously  a high  correlation  coefficient  would  Indicate  that  the  two 
variables  belong  in  a group  and  a low  correlation  would  suggest  that 
they  do  not. 

In  one  of  his  papers  in  1886,  Gal ton  became  interested  in  the 
classification  problem.  He  pointed  out  that  12  measures  proposed  by 
Bertillon  to  be  used  for  classification  of  criminals  were  not 
independent  and  suggested  that  the  observed  measurements  be  transformed 
into  a set  of  Independent  measures.  He  also  suggesced  the  method  of 
transformation , which  we  can  now  view  as  simple  or  unweighted  sumsatlon 
in  factor  analysis.  Thus  quits  early  we  see  the  intermingling  of 
classification  analysis  and  factor  analysis  - and  of  course  this  is  still 
quite  current.  We  will  raturn  to  factor  analysis  and  its  place  in 
classification  analysis. 

Pearson  was  engaged  in  studies  that  were  obviously  related  to 
class if icatlou.  In  an  interesting  paper  in  1901,  he  discussed  mathematical 
representations  of  lines  and  planes  of  closest  fit  to  systems  of  points 
in  specs.  This  geometrical  way  of  looking  at  the  classification  problem 
nay  present  a neater  view  of  the  problem  to  soma.  In  effect,  the  multi- 
dimensional observations  at  hand,  e.g.,  aga,  IQ,  schooling,  number  of 
dependents,  rank,  length  of  enlistment,  etc.,  for  asch  member  of  a 
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population  of  N members  up  for  recnliataent  decision  can  be  viewed  as 
N points  In  a 7 -dimensional  space.  Moreover,  each  point  cannot  be 
reached  by  traveling  along  7 perpendicular  axes,  for  the  7 variables  can 
and  usually  have  degrees  of  association  which  must  be  taken  into  account. 

This  effort  is  a fundamental  problem  in  multivariate  data  analysis, 
namely  finding  a grid  of  orthogonal  axes  to  replace  the  grid  of  correlated 
axes  (naturally  the  points  remain  where  they  are).  If  the  number  of 


dimensions  can  be  reduced  to  two  or  three. 


aaee  ie  achieved  since 


elements  can  be  grouped  by  eye.  In  fact,  this  is  related  to  one  of  the 
central  problems  In  factor  analysis  and  la  pertinent  to  the  use  of  factor 
analysis  an  a classification  technique. 


3.  Assignment  Procedures  and  Discriminant  Analysis 

It  is  now  important  to  ba  specific  about  the  term  "classification**. 
For  our  purposes,  we  will  assume  that  the  term  comprises  both  the 


clustering  of  data  Into  groupa  and  the  assignment  of  data  to  previously 
specified  groups.  Actually,  the  latter  can  be  valued  ae  a subset  of 
the  former.  In  the  former  category,  we  requlra  the  data  to  produce 
both  the  number  of  groupings  or  clusters  and  the  assignment  of  each 
element  cr  individual  to  these  groupings.  In  the  latter  category,  the 
number  of  groupa  or  dusters  is  predetermined.  Bach  group  Is  labeled, 
and  rules  are  designed  on  the  basis  of  which  an  aaalgnmant  of  aech 
element  is  made  to  one  of  the  fixed  groups. 

We  do  not  wish  to  convey  a sharp  distinction  between  clustering  and 
assignment  procedures.  If  a classification  procedure  is  not  producing 
meaningful  groupa  through  the  assignments  that  are  made,  then  changes 
are  called  for,  namely  revising  the  predetermined  groupings  either  in 
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number  or  in  shape  or  in  both  on  the  basis  o£  the  new  information. 

This  sequential  revision  of  groups  on  the  basis  of  the  data  available 
at  different  times  suggests  that  one  is  indirectly  engaging  in 
clustering  procedures.  On  the  other  hand,  it  is  wise  to  keep  in  mind 
the  conceptual  differences  just  mentioned  between  attempts  at  clustering 
and  attempts  at  assignment. 

An  essential  step  in  classification  procedures  is  the  representation 
of  the  relationships  among  the  variables  on  which  data  has  bean  collected. 
Among  other  important  and  prior  steps,  there  are  the  processes  of 
developing  numbers  to  measure  phenomena,  making  decisions  on  the  employ- 
ment of  nominal,  ordinal  or  continuous  data,  and  subsequent  coding  of 
this  data  for  analysis.  In  this  paper,  we  do  not  review  these  Issues, 
but  we  are  mindful  of  their  impact  on  the  data  analyaia  that  will  undergo 
investigation.  Thus,  we  return  quickly  to  clustering  and  assignment 
techniques  and  the  basic  summer isat ions  of  data  for  these  purposes. 

The  clustering  and  assignment  problems,  even  though  they  were 
recognised  for  soma  time,  did  not  possess  any  techniques  until  rather 
recently.  The  assignsmnt  problem  received  the  first  thrust.  The  analysis 
was  provided  by  one  of  the  great  savants  of  nodern  statistical  infaranca, 
namely  R.  A.  Fisher.  In  a pa par  in  1936,  we  find  what  is  now  Fisher's 
classic  work  on  discriminant  analysis.  It  is  antitied  "The  Use  of  Multiple 
Measurements  in  Taxonomic  Problems"  and  was  published  in  The  Annals  of 

antes.  * he  author  was  to  say  somewhat  later  that  ths  papar  waa  written 
to  embody  the  working  of  a practical  numerical  example  arising  in  plant 
taxonomy  in  which  tha  concept  of  a discriminant  function  seams  to  be  of 
ens discs  servlcs.  This  la  s simple  but  fascinating  statement,  because 
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It  demonstrates  once  again  that  when  there  is  a problem  requiring 
solution  some  strides  can  be  made.  Too  often  we  find  solutions  looking 
for  a problem,  and  this  Is  something  we  should  be  especially  concerned 
with  in  classification  problems. 

In  his  paper,  Fisher  also  listed  the  basic  data  ha  analysed.  This 
Is  rarely  done  by  authors,  and  so  we  find  the  Fisher  data  and  just  a 
few  other  data  bases  referred  to  time  and  time  again  by  subsequent 
authors  who  are  experimenting  with  new  assignment  or  clustering 
techniques.  In  this  way,  an  anchor  is  provided  against  which  the  results 
of  other  techniques  can  bo  assessed. 

The  data  employed  by  Flaher  was  supplied  by  a botanist,  and  it 
represented  measurements  on  the  irises  of  the  Gasps  Peninsula.  This  data 
was  previously  pub] ished  in  the  Bulletin  of  the  American  Iris  Society  and 
was  therefore  not  a likely  contender  for  a best  sellar.  Since  it  is  e 
classical  piece  in  the  statistical  literature,  let  us  look  at  It  in  some 
detail.  Four  measurements  on  each  of  fifty  plants  in  each  of  three  iris 
cetegorles  were  obtained.  The  categories  are:  Iris  Virginlca,  Iris 

Versicolor,  and  Iris  Satosa.  For  each  of  the  150  plants  already  assigned 
to  one  of  three  categories,  there  are  measurements  of  sepal  length,  sepal 
breadth,  petal  length,  and  petal  breadth. 

If  we  refer  back  to  our  geometrical  representation,  we  have  150 
points  scattered  in  a four-dimensional  space,  except  that  each  point  is 
already  labeled  as  belonging  to  one  of  three  groups.  The  question  is 
whether  in  some  neat  and  simple  way  we  can  separate  the  50  points  belonging 
to  any  one  group  from  the  other  two  sets.  This  is  compounded  by  the  fact, 
in  this  case,  that  two  of  the  irises,  nans 
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ly  Versicolor  and  Virginlca, 


actually  have  a specific  genetic  relationship  and  obviously,  then,  do 
have  sene  overlap.  In  other  words , Fisher  is  looking  for  hyparp lanes 
that  partition  the  four-dlaennlonal  space,  and  after  partitioning, 
hopefully  leave  aech  group  Inviolate.  Algebraically,  he  is  asking  for 
a linear  function  of  the  four  sea su resents  (later  celled  the  dlacrlmlnant 
function)  that  scco^llahae  tide.  As  a reasonable  laden  for  deteralniag 
the  coefficients  of  the  linear  function,  ha  suggests  one  that  will 
aerial ae  the  retie  of  the  difference  between  the  Mane  to  the  standard 
devlatloM  within  species.  To  be  specific,  let  dp,p  • 1, 2,3,4  represent 
the  differ  sacs  in  the  observed  Mens. 

Ttwa  for  any  linear  function,  X,  of  the  Masuranents,  naacly 

X “ Xl*l  + X2*2  4 XJ*3  + *4*4 


the  differ* 


of  X la  the 


las  is 


D - Xjdj  ♦ Xjdj  ♦ Xjdj  ♦ X4d4 


while  the  variance  of  X within 


lea  Is  proportional  to 


• i t ***00 

p-l  q-1  p * M 


•here  5 Is  the  sun  of  squares  or  products  la  X and  X 
M PI 

The  particular  linear  function  that  best  dlscrlei nates  the  two 
epeclae  will  be  one  for  which  the  ratio  02/S  Is  greatest,  by  vnrletle 
of  the  four  coefficients  *i'*2*A3**4  * C,B*'  trically  we  are  locating 
the  hyperplaM  that  beat  eepa rates  two  groups  of  points  la  tbs  sense 
that  the  distance  betwoM  the  feur-dlnsasloMl  centroids  is  greatest, 
though  there  are  three  groups  of  irises,  in  effect  Fisher  acts  as 
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If  there  ere  two  groups,  since  Iris  Versicolor  snd  Iris  Vlrglnlca  are 
gsnstlcslly  tied  together.  Mote  that  the  variations  within  species  is 
assumed  to  be  the  saae  in  this  development. 

The  Index  that  la  eaployed  to  provide  the  delineation  is  tied  at 
first  to  the  multivariate  normal  structure  assuaed  for  each  species. 

Yet  it  Is  very  similar  to  the  Indexes  suggested  by  strict  multivariate 
data  analysis  as  we  will  sea  in  the  next  section.  Hare  we  are  maximising 
the  difference  between  the  centroids  of  the  twc  species  of  irises,  or, 
in  other  words,  maximising  heterogeneity  between  groups.  This  thane 
will  carry  through  all  of  our  attempts  of  classification.  Either  we 
will  maximise  heterogeneity  between  groups  or  minimise  the  scatter 
(l.a.,  seek  homogeneity)  within  groups. 

As  a result  of  the  analysis,  Fisher  arrives  at  a linear  discriminant 
function  that  accomplishes  a nice  separation.  For  example.  Iris  Setose 
is  separated  completely  from  Versicolor  snd  Virginica.  It  turns  out 
that  only  one  of  the  four  measurements  is  really  necessary  to  do  this, 
namely  petal  length,  and  this  can  probably  be  seen  by  just  looking  at 
the  150  sets  of  naasuronanta.  This  should  be  something  for  us  to  highlight, 
especially  when  we  get  into  data  sets  for  which  meanings  are  not  so 
specific  snd  measurements  are  not  so  cowsnsurate.  This  will  obviously 
be  so  in  any  numbjr  of  studies  in  criminal  Justice. 

Fisher's  work  has  been  extended  to  assign  an  elmsent  to  any  one  of 


k groups,  and  computer  programs  exist  in  Computer  Center  libraries  co 
accomplish  multiple  linear  discriminant  analysis.  Attached  to  this 
subject  is  the  question  of  how  many  variables  should  be  used  in  a 
discriminant  function.  It  is  obvious  that  the  more  variables  one  uses, 
the  better  the  discrimination  should  be,  but  it  is  also  obvious  that  the 
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marginal  gain  In  using  additional  variables  can  decrease  sharply 
and  therefore  some  variables  can  best  be  omitted  in  the  Interests  oi 
parsimony.  Thus  we  seek  the  best  discriminating  variables. 

Ue  might  also  ask  what  one  would  do  if  one  were  faced  with  the  150 
irises  and  did  not  know  their  groupings;  that  is,  if  we  had  only  the 
four  measurements  on  each,  and  we  wished  to  see  what  number  of  groupings 
as  well  ss  assignments  could  be  amde.  Here  we  are  no  longer  faced  with 
the  assignment  problem  alone,  but  with  the  clustering  problem  or 
grouping  problem,  which  of  course  subsumes  an  assignment  problem.  It  is 
to  this  topic  that  we  now  turn. 

4.  Data  Summarization 

It  is  important  in  talking  about  grouping  to  consider  whether  we 
are  grouping  measurement  variables  or  individuals  or  elmaanta  of  a 
population.  For  the  iris  data,  we  are  grouping  elements  of  a population. 
Quite  often,  one  is  interested  in  grouping  measurement  or  teat  variables. 
The  basic  data  summarization  in  multivariate  data  analysis  will  depend 
on  whether  we  are  grouping  variables  or  element a.  Ue  will  resolve  this 
in  subsequent  discussion  by  first  going  in  some  detail  into  tha  data 
summarisation  question. 

There  are  several  ways  to  begin  the  data  summer icat ion.  All  give 
a picture  of  data  interrelationship,  but  sach  has  special  reaaona  for 
its  employment  by  an  investigator.  One  representation  is  that  of  the 
scatter  matrix.  Here  we  portray  the  total  scatter  or  dispersion 
displayed  by  n individuals  or  elements  each  measured  on  p variables 
(n  points  in  s p-dimsnsional  spsce)  by  a matrix  with  p rows  and  p 
columns  where  an  element  in  the  i^  row  and  colusn,  say  t^,  Is 
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the  sua  of  tho  o cross  products  of  neaauraaents  (taken  around  the  assn) 
on  variable  x^  with  asssurssaats  (taken  srouod  tbs  assn)  on  variable 
«j  , In  brief, 

n 

p I 

tiJ  " J.^ik^l^jk^j^  kij  " kJi'  5i  " ^n  * 


Let  us  label  this  astrlx  T . Naturally  an  elsasnt  in  the  aain  diagonal, 

a>  W feka 

say  1 row  and  1 coluan,  is  tbs  sub  of  tbs  squarss  of  the  deviations 

of  x^  froa  its  assn.  If  p ■ 1,  then  T is  a scalar,  neatly 


® 2 

l <x.-cr  vhara  C 
k«l  * 


i 

i 


i 

k 

t 

f 

4 


If  each  slsaaat  la  tbs  scatter  astrlx  T is  divided  by  n,  the 

resulting  astrlx  is  tbs  covariance  aatrix  with  cell  entries  s^  and 

we  label  this  tC  . Now  if  ws  also  divide  each  elsasnt , s^,  in  K 

by  the  standard  deviations  of  x^  and  x^,  tbs  resulting  eleaent 

r^  ■ 1*  the  correlation  coefficient  bstwnsn  x^  and  x^  and 

the  resulting  astrlx  is  now  tbs  correlation  aatrix  which  ws  label  K . 

An  laportant  advantage  of  T is  the  asnnar  in  which  it  can  bs 

decomposed  Into  two  astricss  that  are  especially  pertinent  In  clustering 

and  classification  studies.  In  a classification  study,  the  n eleaents 

will  be  assigned  to  k predetermined  groups,  lech  group  with,  say, 

nl  eleaents  can  be  viewed  as  a universe  with  its  own  scatter  astrlx 

foraed  as  before  and  labeled  V.  . If  we  aua  all  the  W.  scatter 

k 1 1 

astricss,  ws  get  V - J V.  and  1st  this  represent  the  within  scatter 
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or  homogeneity  of  the  groupings.  Likewise,  If  for  each  of  the  k 
groups,  we  compute  the  group  man  (a  p -dimensional  vector  where  the 
r**1  coordinate  Is  the  mean  value  based  on  the  observations  for 

*r)  and  then  produce  the  (p*p)  matrix  that  we  label  B,  for  it 
expresses  a measure  of  the  "betweenness"  or  heterogeneity  of  the  k 
groups.  The  central  point  In  this  development  Is  the  existence  of  the 
fundamental  matrix  equation 

T - W + B . 

This  result  suggests  immediately  an  index  by  which  classification 
(predetermined  number  of  groups)  can  be  evaluated  and,  by  extension, 
how  clustering  can  be  terminated  at  some  cluster  site.  For  any  given 
data  sat  T la  fixed.  Thus  measures  of  "grouplnesa"  or  "clusterlnass" 
as  functions  of  W and  B are  thrust  forth  for  examination. 

For  p • 1,  the  matrix  equation  reduces  to  an  equation  about  scalars 
Thus  a good  grouping  index  Is  one  which  minimises  V or  equivalsntly 
maximises  B . We  may  also  consider  maximising  either  the  ratio  B/W 
or  T/W  1 + B/V  . An  added  benefit  is  that  this  ratio  is  invariant 
under  linear  transformations  of  the  data.  Statisticians  have  long 
exploited  this  fact,  for  B/W  multiplied  by  an  appropriate  constant  is 
the  familiar  F ratio  in  the  analysis  of  variance. 

When  the  number  of  measuraamnts  per  element  is  two  or  more  (p  > 1), 
grouping  criteria  are  not  so  straightforward.  Several  possibilities 
suggest  themselves  and. have  been  developed  and  studied  by  investigators. 
One  criterion  suggested  by  several  authors  that  is  a quite  natural 
index  is  the  minimisation  of  the  trace  of  W (sun  of  all  elements  in 


the  Min  diagonal  of  tbo  matrix)  over  all  poaalble  partitions  Into  k 
groups.  This  is  equivalent  to  naxlnialng  Trace  B bscause 

Tract  T » Tract  V + Tract  B 

However , Trtct  W is  invariant  only  undtr  an  orthogonal  transformation 
and  not  undtr  non-alngular  linaar  tranafo mat Iona. 

Anotbor  crittrlon  that  My  bt  Mployad  for  p > 1 la  tha  ratio  of 
tha  detaralnanta 

|t|/|h|  - |i  + w-1b|  . 

Ua  can  uao  |t|/|w|  aa  a crittrlon  for  grouping  and  aalact  that 
grouping  for  which  this  index  is  naxlnlsad,  or  aqulvalantly  |w|  la 
minimised.  Also  wo  My  anploy  log(  |T  | / | w| ) sines  it  is  a aonotonlc 
function. 

Another  crittrlon  for  grouping  is  tha  tract  of  V-1*  and  wa  salact 
tha  grouping  that  asnlnlaes  thla  index.  This  lndax  has  boon  usad  as 
a tost  statistic  in  aultlvariata  statistical  analysis  as  has  tha  ratio 
| w| / |t|  . Tha  lattor  ms  anployod  by  Wilks  to  tost  vhathsr  groups 
differ  in  naan  values , and  tha  fomar  has  bean  put  forth  by  Hotelling 
in  som  situations  and  by  Kao  as  a generalisation  of  tha  Mahalanobis 
dlataace  between  two  groups  for  k > 2 groups.  Wa  will  shortly  define 
and  disc use  tha  implications  and  uses  of  tha  Mahalanobis  distance  in 
cluttering  procedures. 

Both  Trace  (W-1B)  and  |t|/|w{  My  be  axpreaaed  in  tens  of  the 
eigenvalues,  of  tha  Mtrlx  w”1B  . We  write 

|t|/|v|  - n (i  + a.) 

i-i  1 
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and 


Tract  W B - l X. 

1-1  * 

where  X,  art  tha  roots  of  tha  detamlnantal  aquation*  | B— Xw|  - 0 . 

Tha  charactarlsatlon  of  thaaa  ratloa  In  terns  of  eigenvalues  ia  halpful 
In  data  rapraaantation  aapac tally  whan  tha  af facta  of  sons  raductlon 
In  dinenslonality  la  daalrad.  All  tha  eigenvalues  of  thla  aquation  ara 
invariant  undar  non-alngular  llnaar  trnasfornetloas  of  tha  data.  It 
can  bo  provad  that  thaaa  aiganvaluaa  ara  tha  only  Invariant a of  W and 
B undar  non-alngular  llnaar  tranafomatlona. 

5.  Dlatanca  Matrix 

Thus  far  wa  hava  dlacuaaad  soon  suanarlsatlons  of  nultlvarlata  data 
In  natrlx  fora,  althar  T (scatter),  K (covarlanca) , or  R (corrala- 
tion)  and  tha  klnda  of  grouping  crltarla  that  ara  auggaatod  by  tha  T 
fornat.  Intuitively*  wa  aaa  that  any  grouping  crltarlon  la  a function 
of  honoganalty  within  groupa  and  hatarogonalty  batwaan  groups  and  tha 
Indexes  alraady  daacribad  ara  spaclflc  quantltlaa  anbodylng  thaaa  notions. 
Ha  shall  discuss  otbar  lndaxas  as  wa  procaad,  but  aach  will  ba  a function 
of  honoganalty  within  groups  and  hataroganalty  batwaan  groupa  In  which 
attonpta  will  bo  nado  to  nininlaa  the  fornar,  aaxlnlaa  tha  latter*  or 
In  affect  do  both.  For  tha  corrslatlon  coaffldant  index,  large  values 
Indies ta  honoganalty;  anall  values  Indicate  hataroganalty. 

Another  natbod  of  sunnorlslng  data  that  la  no  re  appropriate  on 
occasion  la  to  find  tha  dlatanca  between  aach  pair  of  tha  n polnta 
in  tha  p dlnanalonal  space.  This  leads  to  a rapraaantation  In  natrlx 
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fora  of  an  n*n  aatrix  where  each  alaaant,  in  tha  i1*1  row  and  fcha 
coluan,  aay  d^,  la  tba  diatance  In  tha  p-dlaanalonal  apaca  between 
tha  1th  alaaant  or  Individual  and  tha  J1*1  alaaant  or  Individual.  All 
tha  alaaanta  in  tha  aaln  diagonal  ara  saro.  Tha  diatance  aatrix  la  akin 
to  tha  corralatlon  aatrix  in  that  both  aay  ba  viewed  aa  aiailarlty 
aatricaa  - tha  Juap log-off  placa  for  duatarinc  attaapta. 

Tba  daelaion  aa  to  whether  corralatlon  aatricaa  or  dlataaca  aatricaa 
ara  to  ba  aaployad  la  uaually  datarainad  by  tha  problaa  at  hand.  If 
n individual a or  n alaaanta  ara  to  ba  groupad  oa  tha  baala  of  p 
aaaauroaaata  on  aach,  than  tha  a«a  dlataaca  aatrix  la  tha  natural 
aanaarlaatlon;  if  tha  p aaaauraaaat  varlablaa  ara  to  ba  groupad  an  tha 
baala  of  tha  aaaouraaaata  on  n Individ uala  or  n alaaanta*  than  tha 
p»p  corralatlon  aatrix  la  tha  natural  auaaarlaatlon  of  tha  data. 

Thla  lattar  aatrix  la  tha  natural  baginning  point  In  factor  analyala 
vhara  paralaony  In  tba  nunbar  of  latant  aaaauraaaat  varlablaa  la  a 
daalrad  goal.  Ha  trill  raturn  to  factor  analyala  and  lta  place  la  cluetaring 
in  eubaequaat  aactlona.  Ia  aaea  taaonoalc  altuatloaa  tha  quaotloo  of  which 
aaaoura  of  aiailarlty  to  eaploy,  whether  It  la  of  tha  aeeoclatlon  or 
dlataaca  type*  will  require  aoae  thought.  While  we  will  couch  on  theae 
point  a*  theae  inqulrlee  will  not  ba  featured  la  thla  axpoeltloa. 

Tha  notion  of  a dlataaca  aatrix  will  ba  placed  in  aharper  focue,  and 


i thla  will  be  done  by  aoae  dlecuaalon  of  appropriate  diatance  aaaauraa. 

* Becauaa  we  will  no  really  think  of  our  data  baeee  for  cluatarlag  individuals 


or  alaaanta  aa  n poiata  la  a p-dlaanalonal  apace,  tha  diatance  aaaauraa 


I 


uaually  appropriate  and  available  ara  Euclidean  dlataaca  and  Mahal anoblo 
dlataaca.  Tba  luclldeaa  dlataaca  between  individual a or  alaaanta  with 


respect  to  all  p measurement  variables  may  ba  written  In  vector 
notation 


where  la  the  Euclidean  distance  between  Individual  1 and 

individual  j , P4  and  Pj  are  c'iumn  vectors  each  with  p rowa 
listing  the  p measurements  on  the  i1*1  and  J1*1  Individuals  respectively. 
The  product  of  the  difference  row  vector  (P^-P^ ) ’ its  transpose  la 
a scalar.  This  Is  the  distance  function  with  which  aost  of  us  are 
familiar.  The  Mahalanobls  distance  nay  be  written  as  in  the  notation 
above  as 

i k 

where  w"1  Is  the  Inverse  natrlx  of  «■  [ ».  and  V is  obtained 

1-1  1 1 

for  each  of  the  1 - l,3,...,k  groups  by 


wt- j1<F^-ci>^-ci>’  • 

Mote  that  a grouping  of  elements  la  necessary  to  compute  W4  and 
consequently  W . Thus  the  Mahalanobls  distance  takes  into  account  the 
associations  or  interrelationships  in  the  measurement  variables.  If 
two  measurement  variables  are  highly  correlated,  the  Euclidean  distance 
can  be  misleading  because  of  the  equal  weight  it  imposes  inaccurately  on 
each  maasurament  variable,  but  this  will  not  be  so  with  the  Mahalanobls 
distance.  The  Mahalanobls  distance  is  mors  tedious  to  compute  and  for 
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* loaf  tin  It  mi  avoided  for  thia  reason  a Iona,  but  tha  computer  has 
brought  It  within  roach*  Actually  if  each  of  the  correlations  between 
the  eeaaurwttnt  variables  la  low,  tha  error  In  anploylng  tha  Euclidean 
distance  la  not  cUnaglng.  As  a rule  of  thueb,  correlations  as  high  as 
0*5  will  not  produce  Euclidean  distances  that  lend  to  operational 
difficulties. 

Other  distance  neasures  appear  In  the  literature.  The  Minkowski 
distance  Is  the  nans  applied  to  all  distance  Manures  that  are  of  the 
font 

fp  *U/n 

Ms  have  discussed  the  case  n-2  . When  n - 1,  the  label  "city-block" 
distance  is  sonatinas  enploysd  and  it  My  be  relevant  for  som  distance 
situations. 

6.  Cluwterlnt 

Me  now  look  at  the  clustering  side  of  classification  analysis.  Our 
Min  Raphaels  will  be  on  clustering  ac  an  exploratory  device.  Pevelopnent 
of  essignMne  procedures  la  for  those  who  already  enjoy  the  luxury  of 
knowing  tha  groups  that  exist.  Me  will  place  ourselves  in  the  situation 
where  a body  of  nultldianaslonal  data  has  been  collected  by  sons  investigator 
and  he  wishes  to  decipher  what  kind  of  structure.  If  any,  underlies  the  data 
collected.  A wide  variety  of  tacbol*  as  have  been  suggested  and  attempted. 
They  run  the  geaut  fron  looking  at  all  possible  partitionings  of  the  data  to 
trying  to  sero  la  on  an  opt  Inal  partitioning  without  having  to  look  at  too 
nuch  of  all  the  possibilities.  The  loner  net  hod  is  a "dunk"  procedure  which 
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is  workable  if  the  computer  can  quickly  look  at  everything,  and  of  course 
this  is  not  so  evan  for  s small  number  of  observations  in  a small  number 
of  dimensions.  Thus  we  sacrifice  optimal  partitioning  for  what  we  hope 
ere  suboptimal  partitions  that  can  be  achieved  much  more  cheaply. 

Let  ua  consider  one  general  way  of  looking  at  the  problem  considered 
by  several  authors.  Us  start  with  any  given  partition  into  g groups. 
Consider  moving  a single  object  into  every  group  other  than  the  one  it  is 
in.  If  no  move  will  create  a partition  for  which  \ clustering  criterion 
is  increased,  leave  the  object  where  it  is.  Otherwise,  move  it  so  that 
the  maximum  increase  in  the  criterion  occurs.  Naturally,  we  are  assuming 
the  existence  of  s reasonable  criterion.  |f-«ng  the  partition  thus  created,, 
we  process  the  sscoud  object  in  the  sane  way,  then  the  third,  etc.  After 
several  passes,  one  will  reach  a point  at  which  no  move  of  a single  object 
from  the  group  it  la  In  to  a different  group  will  cause  an  Increase  in  the 
criterion  function.  At  this  point  we  say  we  have  found  a “local  maximum" 
of  our  criterion  function.  This  rarely  takes  sure  than  a reasonable  time 
on  a computer.  This  has  been  labeled  the  “hill-climbing"  pass  algorithm 
by  Friedman  and  Rubin. 

They  end  othere  have  suggested  modi fleet ions.  For  example,  wo  start 
with  tha  beat  partition  yet  known.  Then  process  ona  group  at  a time,  in 
sequence,  by  piecing  each  object  of  the  group  being  processed  Into  the 
outside  group  with  nearest  center  of  gravity,  recalculating  the  criterion 
function  after  each  move.  This  is  dons  in  order,  the  object  nearest  an 
outside  group  being  moved  first.  Although  the  criterion  initially 
decreasee,  it  nay  at  soma  point  during  tha  process  achisva  a value  higher 
than  previously  found.  This  will  sspsc tally  be  the  case  if  the  groun 
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being  processed  consists  of  two  clusters  widely  separated  In  space.  After 
processing  all  the  objects  of  one  group,  we  restore  the  best  partition  yet 
found,  and  proceed  to  process  the  neat  group.  This  has  been  labeled  a 


a" 

r. 


; ( 


"forcing  pass"  algorithm.  It  is  defined  as  the  application  of  this 
procedure  once  to  each  group,  in  sequence.  Forcing  passes  are  repeated 
until  they  produce  no  improvement.  These  passes  are  relatively  fast, 
compared  to  hill-climbing,  since  we  need  not  evaluate  every  possible  eove 
for  an  object. 

Still  another  procedure  proposed  by  PrledMn  and  Rubin  and  others 
Involves  starting  with  a partition  Q (we  vee  the  best  psrtltlon  currently 
known)  and  reassigning  each  object  to  the  group  with  nearest  center  of 
gravity.  The  value  of  the  newly  formed  partition  is  then  calculated. 

With  either  of  the  other  two  criteria  just  discussed,  we  use  the  metric 
defined  by  the  matrix  W * cotuputad  from  the  partition  p — l.e. , 
d(P,Ck)  - (P-Ck)W~1(P-Ci)T  . The  centers  of  gravity  and  the  scatter 
matrix  V are  maintained  as  those  of  the  original  partition  Q until  all 
n objects  have  been  reassigned,  at  which  time  new  values  for  and  V 

are  computed.  This  contrasts  with  hill-climbing,  for  which  the  partition 
and  the  derived  W change  with  each  move  of  an  object. 

The  reassignment  of  each  object  in  the  above  manner  la  termed  a 
"reassignment  pass".  Reassignment  passes  are  repeated  until  a partition 
with  higher  value  is  no  longer  achieved.  Sets  of  forcing  passes  and 
reassignment  passes  are  alternated  until  neither  produces  improvement,  and 
then  hill-climbing  is  resorted  to  for  a new  local  maximum.  Ocher 
modifications  are  also  applied,  but  whan  it  proves  impossible  to  reach  a 
higher  local  maximum,  the  procedure  is  terminated.  If  one  la  willing  and 
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financially  able  eo  spend  the  computer  ties,  one  can  repeat  the  entire 
procedure  using  another  starting  partition  choaen  at  randan  or,  as  we 
will  soon  see,  obtained  by  a quick  step-vise  setthod.  The  forcing  and 
reaaslgneent  passes  ere  fast,  but  only  occasionally  helpful.  Restarting 
fron  each  of  several  randan  partitions  or  the  step-wise  solution  is  slow 
but  provides  nora  confidence  in  the  result. 

7.  Initial  Partitioning 

There  is  a nuch  slapler  way  of  initiating  clustering.  It  was  proposed 
by  King  and  in  effect  gives  a quick  initial  partitioning  of  the  data 
whether  it  be  asaaurenent  variable  groupings  or  delineation  of  individuals 
la  a population.  Either  sown  thing  of  interest  and  use  to  the  investigator 
sppaars  quickly,  or  what  doss  enarge  can  serve  as  the  first  step  for  those 
algorithms  that  require  a start  upon  which  various  kinds  of  iterations  are 
attssqitad.  These  were  Just  described  in  the  previous  section. 

The  procedure  proposed  by  King  la  a step-wise  clustering  procedure. 
This  is  its  principal  asset  because  it  leads  to  a slaple  and  quick 
algorlthn  that  involves  (n  - 1)  scannings  of  a correlation  natriz  based 
on  n variables.  At  each  scanning  or  pass,  the  variables  are  sorted  into 
a number  of  groups  that  is  one  less  than  at  ths  previous  pass.  In  this 
way,  we  obtain  (n-k)  groups  of  variables  at  ths  k**1  scanning.  The 
(n  * n)  natrlx  can  also  be  a distance  matrix.  In  chat  case,  we  aort 
individuals  or  elanents  into  groups. 

The  procedure  operates  as  follows.  Da  will  aaploy  ths  correlation 
natrlx  as  our  similarity  natrlx  for  expository  purposes,  sad  bring  in  the 
distance  natrlx  whan  appropriate  to  highlight  differences. 
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As  a start,  we  can  view  the  n variables  as  n groups,  one  variable 
to  each  group.  Now  scan  the  correlation  Matrix  for  the  aaxiaum  call  entry 
(naturally  without  regard  to  sign).  In  a distance  aatrix  we  would  seek 
the  ainiaua  distance  cell  entry.  Suppose  the  aaxlaua  correlation  Is  between 
variables  and  X^  . Label  it  r^,  . We  place  and  X^  in  the 

nase  group,  and  we  now  have  (n-  1)  groupa  XjXj, . . • , (X^,X^) Xn-l*^n  ' 

This  produces  an  (n-  1)  * (n-  1)  correlation  aatrix,  all  pairs  of 
correlation  coefficients  over  the  original  (n  -2)  variables  plus  the 
correlations  obtained  by  pairing  each  of  these  with  the  concocted  variable 
X^  + Xj  - . Essentially,  we  are  representing  the  group  of  two  elements 

by  its  centroid. 

On  the  second  pass  of  what  is  now  an  (n-1)  « (n-  1)  correlation 
aatrix,  a third  variable  may  join  the  group  of  two  variables  formed  on  the 
first  pass  if  the  correlation  between  it  and  is  maximum,  or  the 

maximum  correlation  value  in  the  reduced  correlation  matrix  nay  again 
Involve  two  individual  variables.  Thus  we  would  get  either  one  group  of 
three  variables  and  (n-3)  groupa  each  containing  one  variable,  or  two 
groupa  each  containing  two  variables  and  (n  - 4)  groups  each  containing 
one  variable.  In  either  situation  we  merge  variables  snd  revise  the 
correlation  matrix  as  on  the  first  pass.  In  chs  former  case,  the  centroid 
of  the  group  of  three  variables  represents  its  group,  end  in  the  latter 
case,  each  group  with  two  variables  is  represented  by  ite  centroid. 

Recall  that  we  do  not  have  to  divide  the  aum  of  the  variables  by  the  numbar 
of  veriebles  to  obtain  the  centroid  because  the  correlation  coefficient  x a 
invariant  when  one  variable  of  the  pair  is  always  multiplied  by  the  same 
constant. 
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Thus,  At  Mich  piu,  the  two  groups  with  chs  hlghsst  corrslscloss 
ars  merged  and  tha  total  number  of  groups  to  that  point  la  rsducad  by 
ona.  After  a varlabla  has  Jolnad  a group  of  variables,  It  cannot  ba 
ranovad  fron  that  group.  In  this  way  It  is  posslbla  to  nlss  an  optiaal 
grouping.  This  is  wary  similar  to  aalactlon  of  prod 1c tors  in  step-wise 
llnaar  ragrassion.  It  should  also  ba  asntlonad  that  a group  can  loan  Its 
ldantity  by  asrglng  with  another  group  on  a latar  pass.  >y  tha  tlaa  all 
tha  scanning  is  conplstad  wa  haws  produced  successively  (n  - 1) , (n  - 2) , 
(n  - 3), . .. ,3,2  groupings. 

Tha  clustering  lades  employed  by  King  for  measuring  tha  worth  of 

tha  grouping  is  that  of  minima  correlation  (or  maximal  distance)  between 

tha  group  centroids  whan  tha  scanning  has  placed  tha  variables  into  two 

* 

groups.  This  lasvas  a one thing  to  ba  desired  because  it  does  not  look  at 
tha  effectiveness  of  tha  grouping  whan  norm  than  two  groups  are  involved. 
He  also  reviews  another  index,  suggested  originally  by  Wilks  for  testing 
the  mutual  independence  of  k subsets  of  n multivariate  normal  random 
variables.  In  tarns  of  what  we  described  earlier  in  the  paper,  the  index 
is  the  raclo  of  tha  determinants 
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where  T la  tha  scatter  matrix  defined  previously  and  each  is  the 
scatter  matrix  for  each  of  the  k groups. 

This  index  has  some  nice  geometrical  and  statistical  propsrtlss.  For 
example,  when  k*?. 
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where  r4  is  the  itlt  canonical  correlation  between  tha  two  aeta  of 
variables.  This  index  may  be  viewed  as  a "generalized  alienation  coef f i- 
cient"  aince  it  la  an  extension  of  1 -R  , where  R la  the  multiple 
correlation  coefficient  occurring  when  two  groupa  have  one  variable  in 
one  group  and  (n-1)  in  the  other.  However , It  la  not  too  useful  in 
aoee  data  analyses,  especially  in  social  science,  because  a nuaber  of 
data  sets  lead  to  quasi-singular  correlation  matrices  and  truncation  error 
can  give  ridiculous  results.  For  this  reason,  and  possibly  others,  negative 
deterninants  appear  and  sake  it  impossible  to  esploy  the  Wilks  index. 

Let  us  look  at  the  King  eat hod  for  two  particular  data  bases.  The 
first  is  in  connection  with  a penalty  jury  decision  in  California,  and 
the  second  is  the  iris  data  we  discussed  previously. 

Individuals  convicted  of  murder:  238  individuals  convicted  of  first-degree 

aurder  in  California  over  a recent  ten-year  period  were  studied  on  the 
basis  of  2S  aaasureaenta  each  as  to  whether  an  association  existed  between 
their  25-dlmenslonal  descriptions  and  the  penalty  decision  that  reaultad 
in  life  iaprisoneent  for  135  and  capital  punishment  for  103.  These  25 
variables  consisted  of  biographical  Information  on  the  individual,  descrip- 
tion of  the  crime,  lnfornation  on  defense  counsel,  the  prosecution,  and 
the  judge.  A King  step-wise  clustering  procedure  was  employed  to  cluster 
the  238  individuals  and  then  seek  a substantive  association,  if  any, 
between  the  characteristics  of  the  individual,  characteristics  of  the 
crime.  Judicial  process,  and  the  penalty  decision.  My  thanks  for  the  dart 
under  analysis  go  to  seversl  Law  Review  students  at  Stanford  with  whom  I 
worked  on  this  study.  One  of  their  major  concerns  was  to  ses  if  there 
were  any  association  between  the  penalty  decided  upon  by  a jury,  which 
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under  the  law  Is  given  no  instriu  i ion  on  standards  to  be  employed  in 
arriving  at  a decision,  and  socio-econoalc  character 1st lea  or  racial  and 
ethnic  background  of  the  Individual.  The  clustering  printout  did  not 
reveal  any  significant  associations  between  penalty  and  whether  the 
defendant  was  bla-k,  Mexican-Aner lean , or  white;  or  whether  the  defer*  .’.ft 
was  a blue-collar  worker  or  not.  At  the  58th  pass,  there  was  one 
significant  group  that  contained  18  neabers,  all  of  whoa  had  received 
the  life  penalty.  Aa  the  nuaber  of  passes  Increased,  this  group  regained 
the  principal  group  until  the  last  few  passes.  At  the  75th  step  the 
group  contained  34  Members,  of  whoa  30  received  life  iaprlsot— ant . At 
the  100th  step  the  group  contained  42  life  cases  out  of  62  neabers,  and 
at  the  125th  step,  the  group  contained  63  life  cases  out  of  102  neabers — 
a 62  to  38  percent  nlxture  for  all  238  cases.  Vfhat  we  seen  to  be  getting 
is  clustering  indicating  very  little  or  no  association  of  penalty  with 
defendant  and  judicial  characteristics.  This  nay  also  have  judicial 
lapllcationa;  for  a penalty  jury  Is,  in  effect,  tossing  for  each  defendant 
a coin  which  lands  head  or  tall  in  a 55  to  45  percent  ratio. 

Irises:  In  Fisher's  well-known  paper  on  the  linear  dlacrlainant  function, 

he  enployed  three  groups  of  irises,  each  containing  50  neabers.  Sepal 
width  and  length,  petal  width  and  length  were  obtained  for  each  of  the  150 
Irises— 50  Iris  Setose,  50  Iris  Vlrglnlca,  50  Iris  Versicolor.  He  will 
aasune  only  that  we  have  150  Irises  represented  as  points  In  a four- 
dlaan atonal  space  which  we  wish  to  cluster  by  the  Xing  step-wise  clustering 
schema.  The  results  are  Interesting.  The  Iris  Setosa  are  quite  different 
fron  the  other  twrj,  which  overlap  a great  deal.  Thus  we  find  at  the  137th 
pass  that  there  Is  a cluster  of  48  neabers,  each  an  Iris  Setosa;  there  sv> 
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four  clusters  containing  23,  24,  17,  and  24  asnbsre  respectively,  with 
12,  4,  16,  and  18  Irla  Versicolor  raapactlvaly,  all  dsaxmetrating  the 
natural  overlap  between  Iris  Versicolor  and  Iris  Vlrgtnlca.  At  the  very 
next  pass  (138th)  the  two  groups  with  24  ne*be~s  each  aerge  Into  a group 
with  48  nenbers,  22  Iris  Vernlcolor  and  26  Iris  Vlrginlcs*  Thus  when 
there  Is  reel  and  decided  overlap  the  step-wise  clustering  schane  reflects 
its  but  if  we  did  not  know  of  the  original  three  groups,  we  would  be  herd 
pressed  for  a decision,  and  cbviously  would  have  to  resort  to  additional 
techniques,  or  expertise,  ir  both. 

These  data  bases  and  several  others  are  discussed  la  a paper  by 
Solonon  (111*  In  that  paper  eona  coeputer  printouts  for  the  King  procedure 
are  displayed. 
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Data  lepresentatlon  Techniques 

An  interesting  idea  la  aultlvarlate  data  analysis  has  been  proposed 
by  C hereof f [1).  It  Is  a graphical  data  representation  technique.  In 
his  procedure  Cheraoff  transforms  anltldlasnslonal  vectors  into  hmnn 
faces.  Thus,  for  esanple,  several  hundred  vectors  are  transformed  Into 
several  hundred  faces  and  the  faces  are  then  claeelfled  Into  groups 
according  to  the  similarity  perceived  by  the  classifier.  The  thane  here 
Is  that  we  are  very  familiar  through  experiences  la  life  la  classifying 
facial  characteristics.  In  his  paper  Charnoff  presents  a computer  program 
which  handles  up  to  18  dlnsnslonsl  vectors.  Tbs  resder  la  referred  to 
hie  paper  for  nore  details.  j 

Op  to  this  point,  we  have  mentioned  factor  analysis  hut  not  said  j 

ouch  shout  It.  Thera  Is  an  extensive  literature  oe  this  subject,  its 
current  use  in  aultlvarlate  data  analysis  Is  from  the  representation 
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of  view.  Computer  libraries  have  factor  analysis  programs  which  can 
taka  large  order  correlation  netrieea  and  obtain  principal  component 
solutions.  In  this  way  a large  number  of  measurement  variables,  say  50 
to  100,  can  be  transformed  into  many  fewer  variables,  say  on  the  order 
of  5 to  10.  Classification  and  clustering  can  then  be  applied  to  multi- 
dimensional vectors  of  small  order.  A real  payoff  occurs  when  the 
largest  two  or  three  factors  are  employed,  because  a graphical  display 
can  then  be  arranged.  Whan  this  occurs,  clustering  or  classification 
of  the  data  points  can  be  achieved  by  eye.  See  Soloaun  [11]  for  more 
details. 


9.  Multidimensional  Cont insane y Table  Analysis 

A multivariate  data  analysis  technique  which  is  receiving  more 
attention  these  days  is  that  of  multidimensional  contingency  table 
analysis  (logistic  response  analysis).  A number  of  authors  (e.g. , 

Rollback  [8,9]  and  Goodman  (6),  among  others)  have  dona  fundamental  work 
on  this  technique.  We  will  discuss  this  nodex  by  illustrating  its  use 
to  study  reenlistawnt  decision  in  the  armed  services.  The  data  stems 
from  some  recent  Marine  Corps  analyses. 

In  this  section  the  structure  underlying  contingency  table  analysis 
is  discussed,  and  the  mechanics  oi  obtaining  odds  and  probabilities  for 
the  reenllstmenc  decision  are  illustrated.  The  reenllstmsnt  analysis 
is  based  on  a large  number  of  categorical  variables.  Regression  analysis 
and  similar  multivariate  technique#  for  continuous  variables  become 
inefficient  end  Inappropriate  for  tHs  situation.  Multidimensional  { 

contingency  tabla  analyala,  which  we  now  explore,  ia  more  eul table.  j 
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We  are  interested  In  accounting  for  tha  variation  in  raanlistaanta 


In  a parsinonlous  way  and  with  Meaningful  factors.  Conaldar  a single 
axaaple  with  two  factors^  reanliatnsnt  decision  and  rank.  Assunc  rank 
is  categorized  into  two  levelaf  i.e.,  high  rank  or  low  rank.  The 
reenlistnent  decision  and  rank  of  forty  individuals  night  produce  the 
table 


High  Rank 

Low  Rank 

Rr>**nliatnent 

10 

10 

Mo  Reenlistnent 

10 

10 

which  yields  probaHllty  escinstes 

Hish  Rank 

Low  Rank 

Reenlistnent 

.25 

.25 

Mo  Reenlistnent 

.25 

.25 

or  nore  generally 

High  Rank 

Low  Rank 

Reenlistnent 

p12 

No  Reenlistnent 

P21 

p22 

The  overall  probability  that  a person  reenllsts  is  p^  + p^  " *5  . 

The  probability  that  a reenlistnent  is  of  high  rank  is  also  .5  for 

P11  .25  . 

pu  + pn  .2i  + .25  * *5  ’ 

In  this  esanple,  the  probabilities  of  reenlistnent  are  the  sane  regardless 
of  rank.  This  cable  suggests  reenlistnent  decision  and  rank  are  independent 


A related  Measure  denoted  aa  an  "odds"  Measure  hae  an  Interpreta- 
tion well  known  to  bettors.  In  the  above  asaaple,  if  one  wagers  that  a 
person  selected  at  random  rnenlists,  the  overall  odds,  i.e.,  the  odds  of 
reenllstaent  regardless  of  rank  are  one  to  one  or  even.  Knowledge  that 
the  bet  la  on  tbs  high  rank  group  or  low  rank  group  does  not  change  the 
odds.  Realistically,  however,  the  probability  and  odds  that  a high  rank 
end  a low  rank  will  reenllsc  are  not  the  sane.  As  an  illustration , con- 
sider the  table 


Reenlistnent 
No  Reenlistnent 


IS 

s 

s 

IS 

This  gives  probability  estlaatea 

Reenllstaent 
No  Reenllstaent 


.375 

.125 

.122 

.375 

From  this  cable  the  overall  probability  of  a person  reealisting, 
.379  + .122  » .5,  reaeins  the  sans  but  the  probability  that  a high  rank 
reenlisca  Is 


- .75  . 


Thle  differs  substantially  froa  the  overall  probability  of  0.5  which 
no  longer  ausaarises  the  data.  The  odds  will  change  as  well,  being  three 
to  one  for  high  rank,  one  to  three  for  low  rank.  Tha  information 
contained  in  this  and  the  preceding  table  is  described  in  teras  of  three 
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characteristics:  the  overall  probability  that  a person  will  reenlist, 

the  probability  that  a lev  rank  trill  reenlistt  and  the  probability  that 
a high  rank  will  reenllat. 

The  basic  objective  in  a aore  complex  table  is  to  Identify  the 
alnlaua  number  of  probabilities  that  aust  be  specified  to  adequately 
describe  the  table.  The  specification  of  probabilities  given  in  the 
last  example  can  be  used.  However,  recent  research  has  developed  a aore 
foraal  des^r * ptlve  aodel  alallar  to  analysis  of  variance  or  regression 
sxmlels.  Instead  of  dealing  directly  with  cell  probabilities,  it  is 

4 

convenient  to  deal  with  their  logarithas.  These  new  variables,  the 
logarltfaaw  of  the  cell  probabilities,  have  characterietlcs  similar  to 
neasureaent  data,  and  they  can  be  incorporated  into  a linear  aodel 
whose  parasatars  indicate  the  contribution  of  the  various  factor a and 
their  Interactions  to  the  cell  probability. 

The  linear  aodel  for  estimating  logarithas  of  ptk  (for  our  analysis 
whrre  we  fix  and  eaploy  only  the  marginals)  is 

(9.1)  lnptk  • H + a*  + <*£  ♦ •"  k t - 1,  2,  k ■ 1,  2 

where  lnptk  is  the  natural  logaritha  of  ptk  . The  constant  p is  a 

7 

general  mean  indicating  the  average  value  of  iaptk  • The  parameter  a 

indicates  the  "effect"  of  reenllstaent  decision  on  Anptk  independent 

of  rank;  a measures  the  affect  of  rank  on  £nptk  Independent  of 

Tk 

reenllstaent  decision.  The  parameter  a measures  the  interaction 

effect  of  reenllstaent  decision  and  rank  on  inptk  . For  the  first 

exaaple  cited,  where  all  the  pfcfc  (and  consequently  all  the  iaptk) 

T K 

are  equal,  a and  a are  sero  since  inpfek  does  not  vary  with  either 
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reenlistnant  decision  or  ranks  and  for  this 


Hanes,  pfcJf  la  aqual  to  tha  anti-log  of  y , which  In  thin 
•verall  probability  that  a parson  raonllsts. 


la  tha 


Tha  nodal  in  (9.1)  allows  tha  atap-hy-atap  ceaputatlon  of  call 
probabllltlaa  similar  to  ragrasaion  analysis.  For  asanpla,  if  raealist— 
nant  decision  Is  consldarod  as  a function  of  rank*  tha  odds  of  renal lat- 


t (t  » 1)  to  non-reaollstaant  (t  • 2)  for  a gi' 


plk 

— , say  k • 1 for  high  rank,  k - 2 for  loo  rank. 
p2k 


Using  tha  nodal  la  (9.1)  to  obtain  those  odds  In  logarithms  fora 
(danotad  heraaftar  aa  tha  log  odds),  wo  gat 


(9.2)  In^-CM  + aJ  + fiJ  + a^-tM+oJ-koJ  + c^-lciJ  + laJJ 


whara  a*  - -a2  sod  a£^  - . 

Sines  tha  a paranatars  naasura  deviation*  fran  a general  naan,  a 
deviation  froa  tha  naan  at  ona  level  leads  to  a deviation  in  tha  opposite 
direction  at  tha  other  level.  Replacing  2glJ  and  by  0*  and 

0«  to  alapllfy  tha  notation  In  (9.2)  yields 


ik  _ aT  . .TK 


(9.2)  la  ~ ■6+1  , k - 1 for  high  rank,  k ■ 2 for  low  rank. 

p2k  * 


Froa  (9.3)  the  log  odds  of  resell staant  to 


on  0 , the  genaral  naan  for  the  log  odds,  and 


tionahlp  be  twee  rank  and  reaalletaaat  doc  1 sloe. 


it  era 


0^*  , tha  reia- 
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To  further  Illustrate  thoM  idui,  lot  ua  eouldtr  another  uaaplr. 
Assume  that  raenlistmeat  la  dependant  on  two  variables:  length  of 

enlistment,  L , and  tha  praaanea  of  absence  of  dependante,  D . Than 
ptld  r*Pr***°ta  tha  probability  that  a spool find  reenllstment  doclsloa 
la  aada  given  an  Individual's  length  of  enlistment  and  dependency  atatua. 
following  tha  prevloua  ana mple,  tha  logarlthn  of  the  odda  of  raenlletlng 
to  not  reenllatlng  aa  a function  of  tha  predictor  varlablee  can  ba 
written  aa 

<•.♦>  to  - eT  + af*  + ef*  eJJ®  . 


Bach  ona  of  tha  0 peranetere  baa  tha  sane  Interpretation  given 

T TL 

prevloualy.  0 la  a general  naan  for  the  log  odda.  Tha  0^  , 1*1 

(two  year  enlistment),  1*2  (three  year  aollatnent),  1-3  (enlist- 

nant  of  four  or  noro  yeara)  are  nunarlcal  neaaurea  of  the  Inpact  on 

reeallatnent  of  anllatnent  length.  Similarly,  tha  0™  are  msnerlcal 

neaaurea  of  the  Impact  of  dependents  on  reenllatnant  where  tha  subscript 

d Identifies  the  number  of  dependents,  d - 1 (no  dependants),  d - 2 

(one  or  more  dapandanta).  Tha  paraneters  are  interaction  tarns. 

It  nay  be,  for  example,  that  the  presence  of  dependents  nay  Influence 

the  reeallatnent  decision  of  four  year  enlistees  differently  than  that 

of  three  or  two  year  enlistees,  first,  dependents  era  nore  enwon  anong 

four  year  enlisteea  and  they  tend  to  hove  nora  of  than.  Second,  four 

year  enlisteea  who  serve  to  end  of  term  tend  to  be  older  at  the  tins 

they  aust  decide  whether  to  reenlist.  Hence  tha  impetus  to  reenllat  nay 

be  greater  anong  members  of  this  group  then  would  be  indicated  by  adding 


» 


the  separate  effaces  of  dependency  statue  end  length  of  anllataent. 

The  preaence  of  a Joint  Interaction  affect  of  length  of  an list sent  and 

TLD 

dependency  atatus  on  reenlistaent  Implies  a non-aaro  032  • 

By  exponentiation  of  each  aide  of  the  log-linear  nodal  (9.4),  the 
od4a  of  reenllatlng  to  not  raenliating  (hereafter  referred  to  slnply  as 
the  odds  of  reenlistaent)  can  be  written  in  the  fore 


(9.3) 


pltd 

p2id 


.T  »TL  »TD  .TLD 
6 6i  €d  6ld 


where  the  6's  are  ths  anti-logs  of  the  8'a  . Xn  this  fora  of  the 
T 

nodal,  6 can  be  interpreted  as  ths  overall  naan  odd*  of  reenlistaent 
which  is  nod if led  by  nor#  detailed  lnforaation  about  the  levels  or  values 
of  the  predictor  variables  and  their  Interactions. 

For  the  full  aodel,  the  overall  odda  6T  la  estlaated  sa 


SI 


8*  -2.60 

m at  g 


.074 


that  la,  the  odda  are  .074  to  one  in  favor  of  reenliataant.*  If  the 
odds  of  reenliataant  are  desired  for  Harlnee  who  enlist  for  four  years, 
we  need  to  coapute 


3T  - (.074)  (2.46)  - .182  . 


*Note  that  thia  la  not  the  odds  that  would  be  conputed  directly  from 
the  observations,  but  rather  fron  their  logerithalc  transfoms,  then 
averaging,  then  transforming  back  to  the  odda  dotain.  Thus,  this  "naan 
odds'*  is  a multiplicative  assn,  not  an  additive  naan. 
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Thus,  the  odds  of  reenlistment  increase  from  .074  to  .182  for  Marines 
who  enlist  for  four  years. 


i- 

t' 


The  calculation  can  be  extended,  for  example,  to  Marines  who  enlist 
for  four  vf'ars  who  have  one  or  more  dependents  by  the  end  of  their 
enlistment  period.  If  these  independent  variables  entered  linearly  In 

the  model,  the  estimated  odds  for  reenlistment  would  be  given  by 

- t /'XL  *TD 

6 6^  <$2  » but  since  dependency  status  and  length  of  enlistment  ere 

found  to  interact  jointly  on  enlistment,  the  odds  of  enlistment  for  this 
group  of  Individuals  are  given  by 

(9.6)  6T  ^ - (.074)  (2.46)  Cl-72)  U**&>  “ -*57  » 

where  the  last  term  measures  the  interaction  effect  of  L and  D . 

Mote,  the  odds  of  reenllatmenl  for  four  year  enlistees  with  one  or  more 

dependents  would  have  been  substantially  underestimated  if  the  first 

order  interaction  effect  had  been  omitted  from  the  calculation. 

As  can  be  seen  from  this  example,  the  estimation  of  a small  number 

of  <5"b  permits  the  computation  of  odds  of  reenlistment  for  individuals 

having  very  diverse  characteristics.  It  should  be  noted  that  as  in  the 

case  of  regression  analysis,  the  coefficients  of  the  linear  model  (9.4) 

(and  consequently  the  6's  in  (9.6))  show  the  effect  of  a change  in  a 

~TL 

variable  holding  all  the  other  variables  constant.  Thus  6^  measures 

the  direct  effect  of  length  of  enlistment  on  the  odds  of  reenlistment. 

If  an  Indirect  effect  with  dependency  status  is  also  present,  this  is 
‘'TLD 

measured  by  6^  both  the  direct  and  Indirect  effects  of  length  of 
enlistment  are  net  of  the  effects  of  other  variables  such  as  rank, 
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education,  race,  etc.  That  la,  the  effecta  of  variation  in  the  latter 
variablee  on  the  odds  of  reenlistment  are  taken  into  account  in  the 
computation  of  3^  and 

Civen  the  odds  of  reenlistaMtnt  for  Individual*  with  a given  set  of 
characteristics,  it  is  a simple  matter  to  compute  the  probability  of 
reenlistment  for  the  group  from  the  relationship 


For  example,  if  the  probability  of  raenllstlng,  p , is  .07  , then 
the  probability  of  not  reanllstlng,  1-p,  la  .93  , and  the  odds  of 
reenlistment  are  .074  tc  one.  Solving  for  p in  (9.6)  yields 


....  _ , . . . . odds  of  reenlistment 

(9.8)  Probability  of  reenllsting  - r ^lu 


In  these  calculations  it  is  important  to  distinguish  between 

1*1  i*n  TLD 

individual  6's  referred  to  as  "odd*  factors"  (e.g.,  6 , 6 , 6 ) 

T 

which  indicate  how  the  overall  mean  reenlistment  odds,  6 , is  modified 

T TT.  Tj)  TLD 

and  the  product  of  6's  (e.g.,  6666)  which  measures  the 

odds  of  reenlistment  for  individuals  with  a specified  set  of  character- 
istics. Since  (9.8)  converts  the  odds  of  reenlistment  for  a given  group 
of  individuals  to  the  probability  of  reenlistment  for  that  group,  it 
cannot  be  applied  to  the  individual  6's  . 

The  above  dl8cusslon  makes  clear  that  a large  number  of  parameters 
may  enter  the  contingency  table  model,  thus  raising  the  problem  of  iden- 
tifying which  parameters  ere  to  be  Included  in  a model  end  which  are  to 


1 


b 


I 


be  excluded.  Statistical  distribution  theory  and  a Measure  I*,  which 
2 

la  aiallar  to  R , the  Multiple  correlation  coefficient  in  regression 
analysis,  is  used  to  resolve  this  proBlen. 

In  regression  analysis  the  explanatory  value  of  a set  of  predictor 
veriablas  is  eeesured  by  the  peresntage  of  variation  in  the  dependent 
variable  explained  by  the  predictor  variables.  The  base  Measure  of  varia- 
tion in  regression  analysis  la  the  sub  of  squares  about  the  naan  of  the 

2 

dependent  variable,  i.e.,  £(Y^  - ?)  As  predictor  variables  era  added 

♦ a A 

to  the  nodal,  the  predicted  values  of  the  dependent  variable,  , are 
used  to  Measure  the  aaount  of  variation,  E(YA  - f)  , explained.  The 
percentage  of  base  variation  explained  la  then 


100  R2 


- 100 


mm  7 A 1 

Tj  - E(Y1  - Y1)Z 
E(Yt  - ?)2 


One  method  of  Measuring  the  contribution  of  any  particular  variable  is 
2 

the  change  in  R when  that  predictor  variable  is  added  to  the  nodal. 

For  contingency  tables,  the  base  Measure  of  variation  is  computed 
either  as  the  chi-square  statistic* 

or  the  information  measure 


2 E 0 in  | 


*The  symbol  0 stands  for  the  observed  cell  count  and  E the 

estlnatad  cell  count.  The  sunmatlon  is  over  all  cells  in  a table. 
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under  the  hypothesis  that  all  0 paraMtera  la  (9.4)  except  the  general 
seen  are  cere.  1*  le  than  the  percentage  of  baM  variation  explained 
by  the  introduction  of  eoae  collection  of  6 parameters  into  the  nodal, 

lit., 


« ° «■  i>  - (t  0 i>  H-Ul 


I - 


(I  o in  . 

'•  E''  Base 


In  practice,  an  1*  of  70  percent  or  better  la  dealred.  Som  tines  a 
lower  value  la  acceptable  bacauae  Increasing  I*  requirea  the  addition 
of  nany  interaction  parameters  with  the  consequent  difficulty  of  inter- 
pretation. The  priM  objective  la  to  find  the  most  important  parameters. 
When  the  nissber  of  observations  la  large,  paraMtera  signifying  marginal 
impact  will  be  statlatleally  significant.  Thus  we  My  adopt  a convention, 
say,  of  excluding  paraMtera  when  they  increase  I*  by  less  than  two 
parcentaga  point a. 
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SIMPLIFIED  MARKOV  CHAIN  METHOD 


Richard  M.  Brugqer 
US  Army  Armament  Command 
Product  Assurance  Directorate 
RAM  Assessment  Division 
Rock  Island,  Illinois 


ABSTRACT.  Skip-lot  procedure  formulations 
have  previously  been  carried  out  by  using 
complicated  and  tedious  Markov  chain  methods. 

This  paper  describes  a very  short  formulation 
method  using  a simplified  Markov  chain 
approach  which  was  developed  by  the  author 
for  continuous  sampling  plan  formulations, 
but  which  has  application  to  the  skip-lot 
problem  also. 

It  is  common  practice  in  quality  assurance  to  use 
a sampling  plan  to  determine  whether  a lot  of  units 
should  be  accepted  or  not.  For  example,  we  might  have 
a lot  of  size  100,  and  draw  a sample  of  size  seven. 

From  this  sample,  we  will  make  an  inference  about  the 
lot,  thereby  enabling  us  to  make  a decision  about  what 
we  should  do  with  the  lot>  should  we  accept  the  lot  or 
should  we  reject?  The  sampling  plan  will  help  us  make 
this  decision  by  providing  us  with  the  decision  criteria. 
For  example,  we  might  have  an  attributes-type  plan, 
whereby  some  characteristic  of  the  unit  of  product,  its 
paint  job,  for  example,  is  judged  to  be  either  good  or 
bad.  Perhaps  the  sampling  plan  permits  one  of  the  seven 
units  to  be  defective  with  the  lot  still  being  accept- 
able, but  specifies  that  if  two  or  more  units  are  defec- 
tive we  must  reject  the  lot. 

Another  general  kind  of  plan  is  a variables  plan, 
whereby  some  dimensional  property  of  the  unit,  for 
example,  diameter,  is  determined,  and  this  measurement 
is  included  with  other  measurements  to  determine  per- 
haps a sample  mean  and  standard  deviation,  with  which 
suitable  tables  can  be  consulted  to  determine  whether 
the  lot  should  be  accepted  or  not. 


J 

I 

i 
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In  what  will  follow,  we  are  not  especially  concerned 
•bout  what  kind  of  lot  sampling  plan  we  are  dealing  with, 
since  our  discussion  will  apply  to  any  lot  sampling  plan. 
He  don't  care  if  it  is  by  attributes  or  variables,  nor 
whether  the  plan  is  single,  double,  multiple,  sequential, 
or  what  have  you,  with  the  sole  requirement  that  it  must 
be  a plan  whereby  a decision  is  made  on  a lot  from  an 
inference  reached  in  a sample. 

How  let's  take  a look  at  a skip- lot  procedure. 

Figure  1 provides  an  example  of  a very  simple  one,  pat- 
terned after  the  continuous  sampling  plan  CSP-1  of 
Harold  Dodge  [4,  5].  Note  that  the  rules  of  the  proce- 
dure tell  us  that  if  five  consecutive  lots  are  accepted, 
we  may  thereafter  use  some  probabilistic  device  such  as 
dice  or  random  numbers  to  determine  whether  we  should 
inspect  a lot;  wo  want  the  probability  that  a lot  will 
be  inspected  in  this  example  to  be  one  fourth.  Note 
that  we  have  also  provided  the  cautionary  statement 
"provided  there  are  no  indications  that  factors  are  pre- 
sent which  would  have  caused  homogeneity  to  be  lost". 
Obviously,  if  we  found  that  a serious  machine  malfunc- 
tion had  developed  while  a lot  waB  being  run  through  it, 
wo  would  not  want  to  skip  the  inspection  on  the  lot.  If 
homogeneity  is  lost,  or  if  a lot  is  rejected,  we  return 
to  the  100%  phase,  after  which  the  cycle  goes  on  and  on. 

In  this  example  then,  we  see  what  the  skip  in  skip- 
lot  means  - we  skip  the  inspection  or  testing  of  some 
lots.  Why  do  we  want  to  do  this?  The  economic  factor 
is  usually  the  significant  reason.  For  example,  in  using 
a skip- lot  procedure  for  ballistic  testing,  over  seven 
and  one  half  million  dollars  were  Baved  by  the  Army  in 
the  period  from  1966  through  1973.  This  is  described  by 
Charles  E.  Stine  [11] . Since  skip- lot  procedures  are 
analogous  to  continuous  sampling  plans,  much  of  the  mathe- 
matical theory  of  continuous  sampling  applies  also  to 
skip-lot  procedures.  The  first  work  in  continuous 
sampling  was  carried  out  by  Dodge.  As  the  types  of  plans 
generated  over  the  years  became  more  and  more  complicated, 
the  direct  algebraic  approaches  of  Dodge  were  not  suf- 
ficient for  determining  such  properties  of  interest  as 
average  fraction  inspected  curves  and  average  outgoing 
quality  curves. 
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FIGURE  1 


FACTORS  ARE  PRESENT  WHICH  WOULD  CAUSE 
HOMOGENEITY  TO  BE  LOST. 


These  problems  were  overcome  with  the  introduction 
by  Lieberman  and  Solomon  [8]  of  the  theory  of  Markov 
chains  into  the  continuous  sampling  plan  area.  Markov 
chain  methods  were  thereafter  used  a great  deal  in  pro- 
blems in  continuous  sampling  plan  theory;  their  methods 
then  logically  carried  over  into  problems  of  skip  proce- 
dure theory.  As  a matter  of  fact,  Allen  Endres  [6] , an 
employee  of  mine  at  the  time,  presented  a paper  at  the 
Thirteenth  Conference  on  the  Design  of  Experiments  using 
Markov  chains  to  determine  the  mathematical  properties 
of  a rather  complicated  skip  procedure.  Two  recent 
papers  by  Perry  [9,  10]  in  the  Journal  of  Quality  Tech- 
nology make  use  of  Markov  chain  methods  to  describe  skip- 
lot  procedures. 

While  the  Markov  chain  method  permits  solution  of 
complex  problems,  it  still  involves  quite  a bit  of  work. 

In  1970,  after  working  for  several  years  in  the  area  of 
plan  development  and  problem  solution  in  continuous 
sampling  plans,  the  author  developed  a simplifying 
algorithm  [1] . Although  it  is  described  in  terms  of  a 
continuous  sampling  plan,  it  applies  also  to  a skip- lot 
procedure.  The  necessary  derivations  and  justifications 
are  provided  in  [1] , so  there  is  no  need  to  go  through 
all  of  it  here.  Instead,  I'd  like  to  give  a short  philo- 
sophical explanation  of  what  the  simplified  Markov  chain 
method  is  about,  and  a short  example  of  how  it  works. 

For  our  example,  let's  use  what  Perry  [10]  called 
the  2L.2  procedure,  which  he  made  analogous  to  a continu- 
ous sampling  plan  investigated  by  Guthrie  and  Johns  [7] , 
who  obtained  the  plan  from  a family  of  plans  developed 
by  Deman,  Littauer,  and  Solomon  [3] . This  is  shown  on 
Figure  2. 

The  rules  of  the  procedure  are  as  follows:  Start 

with  normal  inspection,  inspecting  every  lot.  When  i 
consecutive  lots  are  accepted  on  normal  inspection,  switch 
to  skipping  inspection  at  rate  ffj_.  If  we  now  have  i con- 
secutive lots  accepted,  we  go  to  rate  f 2 , but  if  a lot  is 
rejected,  we  return  immediately  to  normal  inspection. 

While  we  are  at  rate  f2,  we  return  to  normal  inspection 
whenever  a lot  is  rejected. 


lot  is  rejected 


The  next  step  in  the  usual  Markov  chain  approach  is 
to  set  up  the  transition  probability  matrix,  by  consider- 
ing each  lot  to  bo  represented  by  a state  in  a Markov 
chain.  For  example.  Perry's  transitional  probability 
matrix  is  shown  in  Table  1,  where 

NR  denotes  lot  rejection  on  normal  inspection. 

NJ  denotes  number  of  consecutively  accepted  lots  during 
normal  inspection  is  j (j  - 1,  2,  . . . , i) . 

SlAj  denotes  number  of  consecutively  inspected  and 
accepted  lots  during  skipping  inspection  at  rate  f . 
is  j (j  - 1,  2.  i).  1 

SIR  denotes  lot  rejected  during  skipping  inspection  at 
rate  f^. 

SINj  denotes  lot  skipped  during  skipping  inspection  at 
rate  fj,  and  previous  number  of  inspected  and  accepted 
lots  on  skipping  inspection  at  rate  f.,  is 
j (j  » 0,  1,  i-1).  1 

S2A  denotes  lot  inspected  and  accepted  during  skipping 
inspection  at  rate  f2« 

S2R  denotes  lot  rejected  during  skipping  inspection  at 
rate  f 2 . 

S2N  denotes  lot  skipped  during  skipping  inspection  at 
rate  f2. 

The  simplified  Markov  chain  approach  restructures 
the  problem  by  defining  what  the  Markov  chain  represents. 
Under  the  old  Markov  chain  method,  each  lot  is  repre- 
sented by  a state  in  a finite  Markov  chain.  Under  the 
simplified  method,  we  might  imagine  that  we  have  col- 
lected all  lots  occurring  consecutively  in  any  given 
phase  of  the  procedure  in  big  boxes,  where  the  size  of 
the  box  is  unlimited.  The  labels  on  the  boxes  represent 
the  phases  of  the  procedure.  The  states  of  the  Markov 
chain  represent  the  labels  on  the  box.  In  our  example, 
since  we  have  only  three  kinds  of  labels,  normal,  first 
skipping  level,  and  second  skipping  level,  we  have  a 
Markov  chain  with  only  three  states.  Our  concern  a 
little  later  will  be  with  the  expected  number  of  lots 
in  a box  with  a given  label. 
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Table  2 shows  the  transition  probability  matrix 
using  the  simplified  Markov  chain  approach. 


TRANSITION 

MATRIX  FOR 

TABLE  2 
SIMPLIFIED 

MARKOV  CHAIN 

N 

FSL 

SSL 

N 

- 

1* 

- 

FSL 

1 - P1 

mm 

SSL 

1* 

mm 

• 

•For  P<1 


The  resulting  state  probability  equations  are 


N = {1  - Pi)FSL  SSL 

(1) 

FSL  - N 

(2) 

1 SSL  - PiFSL. 

(3) 

I 

i Solving  for  each  of  the  state  probabilities 

one  of  them,  we  have 

in  terms  of 

N - N 

(4) 

FSL  - N 

(5) 

SSL  » P^-N. 

(6) 

Our  interest  now  is  in  the  coefficients  in  the  resulting 
equation.  These  are  brought  over  into  column  one  of  our 
working  table,  Table  3. 


TABLE  3 
WORKING  TABLE 

1 2 3 4 5 

COEF » SIMP,  EX.  NO.  SIMP.  PROD. 

nii  (i-p^/qp1  f^a-p1)  f^a-p1) 

FSL  1 1 (l-Pi)/f1Q  f2Pi(l-Pi)  f2Pi(l-Pi) 

SSL  P1  P1  l/f2Q  fjP1  fxP2i 


This  is  our  working  table  needed  to  complete  the  solu 
tion.  Column  one  lists  the  coefficients  we  just  mentioned 
Column  two  provides  for  simplifying  column  one  by  clear- 
ing denominators  or  dividing  by  common  factors.  Any  oper- 
ation carried  out  on  one  element  of  a column  must  be 
carriod  out  on  each  of  the  other  elements  simultaneously. 
In  this  case,  there  is  nothing  that  can  be  simplified,  so 
column  two  is  the  same  as  column  one.  Column  three  con- 
tains expressions  for  the  expected  number  of  lots  con- 
tained in  the  boxes  with  the  respective  labels.  Expres- 
sions for  the  various  kinds  of  phases  one  would  expect 
to  encounter  are  contained  in  [1].  Column  four  serves 
to  simplify  column  three,  in  the  sane  way  that  column 
two  is  intended  to  simplify  column  one.  In  this  case, 
we  see  that  we  can  clear  the  denominator  by  multiplying 
each  element  in  column  three  by  f.f.QP^.  Column  five  is 
the  product  of  columns  two  and  four? 


Then 


Let  the  sum  of  the  column  five  elements  equal  D. 

N - fj^d-P^/D  C 
FSL  - f2Pi(l-Pi)/D  (I 
SSL  - fjP2i/D  (! 


We  are  now  ready  to  determine  our  long-run  operating 
characteristic  curve,  which  is  defined  in  terms  of  sta- 
tionary probabilities  as 

Pa  - P[N  + f1FSL  + f2SSL]  + {1-f^FSL  + (l-f2)SSL  (10) 

Notice  that  we  are  saying  that  inspected  lots  are  accepted 
with  probability  P while  all  skipped  lots  are  accepted. 

Substituting  expressions  for  N,  FSL,  and  SSL  and 
carrying  out  a few  algebraic  operations  leads  to  Perry's 
solution 

f„  [P1  + f, (P-P1)]  + (f,-f„)P2i 


fn  P*  + f, (l-P*)]  + (f,-f„)P 
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SEMI  MARKOV  CHAINS  APPLIED  TO  MARKOV  CHAIN 
MODELS  OP  CONTINUOUS  SAMPLING  PLANS 

David  L.  Arp 
Nav.il  Weapons  Center 
China  Lake,  California 


ABSTRACT.  A method  is  presented  for  overcoming  some 
of  the  complexities  of  analyzing  time-homogeneous, 
irreducible,  and  finite  state  Markov  Chain  (MC)  models  of 
Continuous  Sampling  Plans  (CSP's),  with  potential  applica- 
tions to  other  processes,  by  constructing  from  any  such  MC 
a unique  Semi  Markov  Chain  (SMC).  To  this  end,  a class  of 
MC  models  is  defined  in  terms  of  four  different  basic  blocks 
of  MC  states  called  phases  which  are  naturally  and  un- 
ambiguously defined  by  the  MC  structures  considered.  For  a 
phase  of  a given  type,  a time-of-sojourn  probability  density 
funation  (p.d.f.)  is  derived  for  each  possible  exit.  Any 
phase,  together  with  its  p.d.f.'s,  that  occurs  in  a MC  is 
then  treated  as  a SMC  state.  If  self- jumps  of  phases  are 
forbidden,  the  SMC  so  constructed  induces  and  is  induced  by 
a unique  Markov  Renewal  Process  (MRP) ; otherwise  the  MRP 
induces  the  SMC  but  not  conversely. 

This  constructive  technique,  the  Z-transform  calculus, 
and  Renewal  Theory  are  used  to  analyze  at  length,  for  the 
Job  Shop  and  Arbitrary  Entry  cases,  the  two  most  common 
CSP's  and  the  first  two  moments  of  the  Fraction  Inspected 
functional  defined  on  them.  Variations  in  phase  types  and/ 
or  their  p.d.f.'s  are  considered  resulting  in,  for  a given 
MC,  variant  SMC's  which  are  in  turn  studied  using  the 
concept  of  filtration. 

For  the  Arbitrary  Entry  case,  delayed  p.d.f.’s  are 
defined  by  a renewal-theoretical  way  and  by  an  intuitive 
constructive  way  using  an  initial  probability  vector  which 
overtly  depends  on  the  entire  MC  model.  These  two  defini- 
tions are  shown  to  be  equivalent  thereby  proving  the  latter, 
along  with  certain  probability  ratios,  to  the  purely  phase- 
type  dependent.  Using  these  delayed  p.d.f.’s,  it  is  demon- 
strated that  the  resulting  SMC  and  MRP  are  stationary. 

Other  more  complex  standard  and  non-standard  plans 
are  also  dealt  with  briefly. 
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1.0  INTRODUCTION . A prevalent  classical  treatment 
of  a class  of  Continuous  Sampling  Plans  (CSP's)  has  been 
the  study  of  certain  functionals  defined  on  finite-state, 
ergodic,  and  time-homogeneous  Markov  Chain  (MC)  models 
which  distinguish  different  types  of  groupings,  called 
phases,  of  the  states  involved  — screening  (sc),  unlimited 
sampling  (uls) , limited  sampling  (Is),  and  checking  (ck) 
phases  being  the  usual  types  [see  Refs.  7.4  and  7.5]. 

These  phases  are  in  turn  hooked  together  in  various  ways 
and  in  varying  numbers,  in  accordance  with  sampling 
frequency  criteria,  to  generate  the  plans  making  up  this 
class.  Moreover,  time  is  operational  and  is  discretely 
measured  by  "number  of  production  units".  Of  primary 
importance  in  measuring  the  effectiveness  of  such  a 
sampling  plan  is  the  functional  Fraction  Inspected  which 
can  be  defined  as  follcws: 


FI  (N) 


j-0  (SN) 


MSN^> 


Where 

N * number  of  units  (or  run  number) , 

M,N(j)  - t1  if  hSN<3)  - SN 

0 , otherwise 


and  SN  varies  over  all  the  non-inspection  states  of  the 
corresponding  MC  model.  In  deriving  formulas  for  the 
moments  of  FI(N),  only  two  starting  conditions  are  of 
practical  importance;  the  Job  Shop  and  the  Arbitrary 
Entry.  In  the  former  case,  the  components  of  the  initial 
probability  vector  are  1 for  the  starting  state  of  a sc 
phase  and  zero  for  all  other  states  and,  for  the  latter 
case,  the  components  are  the  steady  state  or  long  run 
probabilities . 

To  date,  only  the  first  moment  of  this  functional  has 
been  derived  in  the  Job  Shop  case  for  an  infinite  run  and, 
since  the  long-run  probabilities  are  also  stationary,  in 
the  Artitrary  Entry  case  as  well  (independent  of  N) . In 
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an  earlier  paper  by  the  author  [Ref.  7.1],  in  order  to 
obtain,  for  the  simplest  and  most  heavily  used  of  these 
plans  (CSP-1),  the  mean  and  variance  of  the  above  func- 
tional in  the  short-run  Job  Shop  case  and  its  variance 
for  the  Arbitrary  Entry  case,  advantage  was  taken  of  the 
sparseness  and  regularity  of  the  transitional  matrix  cf 
the  corresponding  model  to  generate  difference  equations 
for  the  salient  transitional  probability  functions  which 
were  in  turn  solved  for  by  the  Z-transform  method. 

Unfortunately,  for  plans  more  complex  than  CSP-1, 
this  method  becomes  less  feasible  because  of  the  increased 
difficulty  in  deriving  the  basic  difference  equations. 

This  situation  arises  from  the  accretion  of  MC  state 
relationships  as  the  plan  complexity  increases  despite 
the  fact  that  the  transitional  matrices  still  remain 
relatively  sparse.  In  addition,  increasing  complexity 
makes  it  harder  to  (1)  obtain  bounds  on  the  moments  of 
FI (H) , (2)  to  study  the  growth  properties  of  and  relation- 
ships between  the  transitional  probability  functions  and 
quantities  derived  from  them  (which  is  hard  even  for  CSP-1) , 
(3)  to  obtain  closed  expressions  and  asymptotic  expansions 
for  these  quantities,  and  (4)  to  quantitatively  analyze 
structural  differences  among  the  various  plans.  It  seems, 
therefore,  that  the  difficulties  enumerated  above  would, 
at  best,  force  a piecemeal  approach  to  CSP's  in  general 
with  each  plan  having  to  be  laboriously  analyzed  from 
scratch.  However,  in  1971,  R.  Brugger  [Ref.  7.3]  pre- 
sented a unified  and  simplified  scheme  of  deriving  the 
mean  of  FI(«)  in  the  Job  Shop  case  and  of  FI(N)  in  the 
Arbitrary  Entry  case  for  sampling  plans  of  this  class 
(with  obvious  extensions  to  stili  more  general  classes) . 

It  is  his  systematic  treatment  that  stimulated  the 
approach  given  in  the  present  paper. 

In  this  paper,  the  drawbacks  to  using  the  dif- 
ference equation,  Z-transform  approach  are  partially 
(and  in  some  respectB  completely)  sidestepped  by  the 
introduction  of  Semi  Markov  Chain  (SMC)  models.  In 
these  models,  each  phase  is  considered  to  be  a SMC  state; 
the  time  from  entrance  to  and  exit  from  a given  phase  to 
another  is  treated  as  the  time-of -sojourn  in  that  state 
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until  that  particular  transition  first  occurs.  Further- 
more, the  probability  density  functions  (p.d.f.'s)  of 
these  sojourn  times  are  obtained  in  essentially  two  dif- 
ferent but  equivalent  ways;  formulas  for  the  first 
entrance  probability  functions  are  derived  either  from  an 
absorbing  MC  or  an  absorbing  SMC  setup  in  which  the  given 
MC  states  of  a phase  are  regarded  either  as  transient  MC 
or  transient  SMC  states  respectively  and  in  both  approaches, 
the  possible  exit  phases  are  regarded  as  absorptive  states 
(all  other  remaining  phases  being  deleted) . Because  the 
original  MC  mode]  of  any  plan  in  this  class  is  time 
homogeneous,  irreducible,  and  finite,  the  SMC  model  con- 
structed from  it  is  also  — a circumstance  which  eventually 
leads  to  a finite  system  of  easily  solved,  linear  convolu- 
tion equations  for  the  desired  probabilities  and  quantities 
derived  from  them. 

The  SMC  method  of  obtaining  the  p.d.f.'s  for  the 
canonical  phases  by  splitting  them  into  new  one-MC-state 
subphases,  obtaining  the  corresponding  p.d.f.'s,  and  then 
reassembling  the  pieces  at  the  end  is  really  just  a varia- 
tion of  the  basic  idea  of  constructing  a SMC  from  af  MC. 
Elaborating  on  this  observation,  similar  departure's  from 
the  prescription  "canonical  phase  SMC  state"  ate  also 
considered  to  aid  in  the  analysis  of  CSP's*.  combining 
two  or  more  canonical  phases  into  a new  (super)  phase, 
splitting  a canonical  phase  into  two  or  more  new  (sub) 
phases,  and/or  altering  the  p.d.f.'s  of  the  phases  by 
the  introduction  of  self-transitions.  Thus  the  word 
"canonical"  (or  "basic")  should  be  considered  only  as 
a handy  reference  term.  This  added  flexibility  broadens 
the  applicability  of  the  constructive  technique  to  in- 
clude MC  models  in  generals  for  example,  weapon-effec- 
tiveness and  acquisition-of-target  models,  skip  lot 
sampling  procedures  [Ref.  7.5],  or  CSP's  with  either 
different  types  of  phases  ;han  those  considered  here 
or  with  two  or  more  of  the  same  type  which  are,  however, 
described  by  different  parametric  values.  Moreover, 
as  will  be  seen,  the  SMC  that  results  from  any  coalescing 
of  phases  is  a filtered  SMC  of  the  original.  Hence, 
using  the  variant  techniques  suggested  by  the  SMC  method, 
we  now  can  associate  with  or  construct  from  a MC  model 
not  just  one  SMC,  but  rather  a partially  ordered  set  of 
SMC's  with  order  relation:  SMC.  < SMC2  iff  SMC1  is  a 

filtration  of  SMC2.  1 


Some  troubles  do  arise  in  two  situations  due  to  the 
non  Markovian  nature  of  a SMC  and  in  a third  setting  due 
to  the  relationship  between  a Markov  Renewal  Process  (MRP) 
and  a SMC.  The  first  difficulty  occurs  in  the  derivation 
of  the  second  moment  of  FI(N);  the  troublesome  point  is 
resolved  by  the  introduction  of  the  concept  of  filtration 
(in  this  case,  phase  segmentation).  The  second  problem 
lies  in  the  meaning  of  stationarity  for  a SMC  and  arises 
specifically  here  in  the  treatment  of  the  Arbitrary  Entry 
case.  This  latter  complication  is  overcome  by  the  intro- 
duction of  delayed  p.d.f. 's,  which  are  equivalent  to  the 
delayed  p.d.f. 's  in  Markov  Renewal  Theory.  The  third 
difficulty  involves  the  proper  handling  of  self-transitions: 
a MRP  will  record  such  jumps  while  the  induced  SMC  will  not. 
If  a probabilistic  interpretation  is  to  be  maintained,  this 
snag  is  handled  by  treating  the  MRP  as  the  primary  object, 
the  SMC  as  secondary. 

1.1  Notational  idiosyncrasies.  Throughout  the  rest  of  this 
paper,  certain  notational  idiosyncrasies  areAobserved. 

(a)  In  dealing  with  transfer  functions  like  Q(z)  say,  many 
times  the  explicit  argument  is  deleted  especially  in 
complex  formulas.  (b)  Many  of  the  proofs  alternate  between 
the  convolutional  or  sequencial  notation  and  the  equivalent 
transfer  functional  one  in  order  to  provide  some  variety j 
the  transfer  or  "hat"  notation  greatly  predominates  however 
because  of  greater  ease  in  manipulation.  (c)  CSP  is  some- 
times used  synonymously  with  MC  model,  sometimes  not;  the 
context  makes  the  usage  clear.  (d ) Since  the  MC  states  are, 
by  tradition,  symbolized  by  upper  case  letters,  the  phases 
by  lower  case  ones,  a minor  inconsistency  arises  whenever 
any  of  the  canonical  phases  are  split?  for  example,  uls 
can  be  split  into  its  component  MC  states  SI  (Sampling 
Inspected)  and  SN  (Sampling  Noninspected)  which  in  turn 
can  be  looked  upon  as  (variant)  phases.  For  simplicity, 
this  "dual"  system  is  retained  here;  for  instance,  when 
necessary,  we  shall  talk  about  the  phase  SN  rather  than 
the  phase  sn. 
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2.0  PRELIMINARY  DEFINITIONS  AND  RESULTS. 

2.1  Z-transform.  Throughout  this  paper,  the  Z-transform 
method  will  be  used  exclusively;  it  is  however  formally 
equivalent  to  the  generating  function  method,  the  trans- 
formation w = 1/z  being  the  bridge  between  the  two  tech- 
niques. Below,  NN  is  the  set  of  natural  numbers  and  RR 
is  the  set  of  real  numbers. 

Definition  1.  Given  a sequence  {a(j)>  considered  as 
a function 

"a;  NN  RR", 

its  Z-tran3form  is 


j»0  z-5 


To  retrieve  the  sequence  a(*),  contour  integration 
is  used: 


a (n)  « 0ra(z)zn-1dz 


Where  r is  the  path  lzl  =»  R(a)  + e;  in  any  subsequent  use 
of  this  formula,  the  following  abbreviation  will  be  used: 


In  definition  1,  a(z)  is  a function  of  a complex 
variable  z,  analytic  in  a neighborhood  of  infinity?  i.e., 
a(z)  is  analytic  for  lzl  > P. (a)  whose  size,  in  turn,  depends 
on  the  growth  properties  of  a(j)  as 

We  next  define  two  heavily  used  standard  sequences, 
the  operation  of  convolution  between  two  arbitrary  ones, 
and  the  Z-transform  of  these  results. 


& 
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Definition  2.  The  Dirac  seauence  at  k 


"6k:  NN  {0,1}"  for  k £ NN,  is  defined  via: 

. ,1,  for  j=k 

“k'-5^  l0,  otherwise. 


Definition  3.  The  Heaviside  sequence  at  k 


"Hk:  NN  -*•  {0,1}"  for  k € NN,  is  defined  via: 

H Ml  m 

"k'-1'  l0,  otherwise. 


Proposition  1. 


7 and  ik(z)  - ^ pc=r)( “S=r) ' 11  e 


Proof.  Clear  from  the  definitions. 


Definition  4.  Given  two  sequences  a ( • ) and  b ( • ) , 
their~con volution  a*b(*)  is  a new  sequence  given  by 


(a*b)  (n)  * E a(:i-k)b(k). 
k-0 


Proposition  2.  Letting  RM  * Max (R(a) ,R(b) ) , 
a*fe‘(2)  * a(z)b(z);  r+fe"(z)  = a(z.)+b(z);  and 

ra'(z)  - ra(z)  for  lzl  > RM  where  appropriate 


and  r £ RR. 

Proof.  The  preceding  definitions  and  the  Cauchy 
product"  for  the  multiplication  of  two  power  series. 

We  next  state  a useful  property  of  the  Z-transform. 


Theorem  1.  (End  point  property)  If  #(•)  ia  a bounded 
sequence , then  a(z)  converges  at  least  for  lzl  * R>1  and 


Lim  z-1 
zf  1 z 


a (z) 


a (») . 


Proof.  See  reference  7.2,  chp.  11. 

2.2  Semi  Markov  Chains.  We  next  define  the  concept  of  a 
sMcV  Given  a finite  set  S , numbered  from  1 through  r , an 
outcome  space  0 (of  sample  paths),  and  a family  of  random 
variables  (X(t)},  t £ NN,  from  £1  to  S,  we  have 

Definition  5.  X ( • ) is  a finite  state,  time  homogeneous 
Semi  Markov  Ghaln  iff  for  each  i £ S,  there  exists  a family 
of  functions  from  NN  to  10,1], 

{Qi,j(t)},  j gSjCS, 


such  that 

1.  0<Qi ^ j (t) , j £ Si 

2.  ™0*Qi#j<«)  - 1.  j 6 si 

3.  P[X(t)«*j  |X(t')-i,  0<t"<t]  - Qi#j(t) 

4.  Qifj (t*,t+t*)  - Qifj(0,t)  - Qifj(t)  . 

The  following  interpretations  can  be  given  to  the  four 
steps  in  definition  5.  A SMC  can  be  looked  upon  as  a MC  in 
which  transitions  take  place  at  random  times;  for  i £ S, 

{QjL  j (t)  ) is  just  the  family  of  defective  p.d.f. ’s  of  the 

time  to  transition  to  some  possible  exit  state;  depending 
on  i,  and  starting  initially  from  i;  i.e.,  the  functions 
are  just  the  time-of-sojoum  p.d.f.'s.  Step  2.  of  the 
definition  guarantees  that  a transition  will  occur  with 


probability  one.  Step  3.  is  just  a more  symbolic  restate- 
ment of  the  interpretation  for  the  functions  Qi,j»  explicitly 
linking  them  to  X(‘).  Finally,  step  4.  is  the  time  homo- 
geneity criterion. 

In  preparation  for  the  next  major  theorem,  we  list,  for 
convenience,  some  abbreviations  in  definition  6.  To 
emphasize  some  of  the  short  hand  notations,  one  should  note 
that  HQ*a(t)  <*  Ea(j),j  * 1 to  t,  and  (6Q*a)  (t)  * a(0). 

Definition  6 . 


a. 

b. 

c. 

d. 

e. 


Ho*Qi,k(t)  " 

IQi  k(t>  - Q±(ti  , k £ S£ 
k 

EAi,k(t)  = Ai(t) ' k € si 
k 

H0*  U0-Q1)  (t)  - . 


Some  comments  on  definition  6 follow.  In  a.,  A.  .(t) 
is  the  (defective)  distribution  function  (d.f.)  for  ' 
the  transition  i-*k.  Q.  in  b.  or  A.  in  CJ  is  the  (non 
defective)  p.d.f.  or  d.f.  respectively  of  a transition 
from  i.  In  d.  Ji,k(t)  is  the  (defective)  d.f.  of  no 
transition  from  i to  k and  finally,  summming  this  quantity 
over  all  possible  exit  states  from  i,  we  get  Ji  which  is 
the  d.f.  of  no  transition  from  i.  In.  the  future,  for 
convenience,  we  let  fc(t)  = 0 if  k0  thus  eliminating 
the  need  for  additional  notation.  Having  definition  6, 
we  can  now  state 

Theorem  2 . (Backward  equations)  If  we  define 
PA  j (t)  ■ P [X  (t)  =j  | X (0 ) - i]  , 
we  then  have 


(F.S.: 


Pi,3<tl  ' 


E < 
k 


i,k*pk.j,(t) 


k £ S, 


<5 . . is  the  ordinary  Kronecker  5 . 
if  3 
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Proof.  Time  homogeneity  and  conditioning  on  the 
time  of  first  transition  starting  from  i;  also  see 
[7.9  and  7.11] . 

Associated  with  any  SMC  is  its  embedded  MC;  we  define 
this  in 

Definition  7.  Let  W be  the  time  for  the  n~  transi- 
tion j'Tet  n 

Y(n)  - X(Wn). 

Then  Y(*)  is  the  Embedded  MC  associated  with  X(»). 

Clearly, 


(+•)  ] 


is  the  transitional  matrix  for  Y(*)>  it  is  time  homogeneous 
since  X(*)  is. 


Letting  Fi,k(t)  be  the  first  entrance  probability  of  i 
into  k,  which  exists  since  transitions  take  place  at  Maikov 
points  or  epochs  (i.e.,  the  Wn's),  conditioning  on  the  first 
entrance,  and  using  the  Z-transform,  we  have 

Proposition  3. 

a.  P,  .(t)  = F.  *P.  . (t)  + J.(t). 

J t J J* J J r J J 


a' . 


and  P 


3»j 


b-  - rj,k*pk,k(t>- 


P . 

A ' J U A A A 

b'-  Fj,k  * andPi ,k  - 'j.k'k.k- 

Ak,k 


Letting  Wn(k)  be  the  time  of  occurrence  (waiting  time) 
of  the  nfch  entrance  into  k by  X(*)#we  have 


,r 


Definition  0. 

N.  (t)  = Max  {n  | W,  (k)  <t} 

Ej [Nk (t) ] = E[Nk(t)  | X ( 0) = j ] = *j,k(t) 

N ( t ) = I N.  ( t)  , k C S 

k K 

M(t)  ■*  (Nk(t)),  k £ S. 

Proposition  4. 

pk,k(t)  * Rk,k*(io-Qk><t> 

pj,k(t>  - Rj,k*(V0k>(t)- 

Proof. 

«•  * H0*Fk,kfn)(t)  - 1 + f, 

n=0  n=l 

* 1 + 2 PktNk(t)>n] 

n=l 

- 1 + ? nPk£Nk(t)=m] 

111=1 

- Ek[Nk(t)I 

b,  taking  Z-transforms , we  have 

A 


j 

V 

.1 


III 

'5 

■i 

3 

*» 

*3 

¥ 


j 

Vi 


r 


i 

1 
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5P- 


JF*» 


Similarly,  we  also  have 

Rjk  = Fjk*Rkk  or  Rj*  * Fjk  *kk' 


c.  From  b,  and  Prop  3a*  and  3b*, 


IIo<1-Qk> 


= Rkk(1"Qk) 


pjk  “ FjkPkk 


FjkRkk(1“Qk) 


Rjk(l-Qk). 


As  in  a finite  MC,  we  can  define  recurrent  state, 
communication  of  states,  irreducibility,  and  periodicity. 
With  these  topics  in  mind,  we  state 

Theorem  3 . A (finite)  SMC  is  irreducible  iff  its 
embedded  MC  is.  A SMC  is  aperiodic  iff  there  exists  j — Si 
such  that  support  (Qij)  £ (it),  t = 1 to  • provided  that 
the  SMC  is  irreducible. 

Proof.  (a)  First  statement  see  [Ref  7.6,  Chp  5], 

(b)  Second  statement  see  [Ref  7.7,  Chp  2], 

The  proofs  are  straight  forward  but  lengthy. 

If  i^  Si  (i.e.,  Qi , i = 0 for  all  i)  for  all  i,  then 
am  irreducible  SMC  induces  a uniquely  defined  MRP  and 
conversely.  The  MRP  can  be  taken  to  be  {(Ynrt),T  is  the 
time  spent  in  Yn  since  the  last  transition)  [Ref  7.6, 

Chp  7]j  in  older  terminology  [Refs  7.11,  7.12],  N(t)  , 
definition  8,  is  defined  as  the  associated  MRP  since 
{ (K,  Wn(K)) , K $ S)  can  be  looked  upon  as  a multiple 
markov  renewal  process.  Conversely,  the  MRP  induces 
the  SMC  vias  X(t)  = ^N(t)* 


1 


Closing  the  section,  we  give  the  basic  theorem  on 
irreducible  SMC's. 


ft 


Theorem  4.  If  us  is  the  long-run  mean  time-of-sojourn 
in  state  s','  then,  given  that  the  SMC  is  irreducible  and 
finite  with  e = (ei,  . ..,  ey)  as  the  corresponding  eigen 
vector  of  the  embedded  MC’  matrix  with  eigen  value  one,  we 
have 

a.  Lim  P4  . (t)  = P. (-) 
t-»  3 


= J--3. 


b.  Lim  ,(z)  - P.  (-} 

Z J 


c.  An  ergodic  theorem  holds: 


if  F is  a functional,  then 


where 


I F (X  (t) ) -*-E  [F]  [a.e.  ] , 


E [F]  = IF  (s)  P ) 
a s s 


■ zF(s)og. 


Proof,  b.  is  just  the  end-pt.  property  of  the 
Z-transform.  a.  and  c.  follow  from  some  straight-forward 
renewal-theoretical  arguments  found  in  [Ref  7.6,  Chps  7,  8] 


2.3  Sampling  Plan  Phased.  In  the  following  descriptions, 
the  box  diagram  for  each  phase  is  given  first  followed  by 
the  MC  description?  in  passing  from  the  former  to  the  latter, 
the  assumption  of  a constant  probability  of  defective,  p,  is 
assumed.  Furthermore,  practically  speaking,  upon  finding  a 
defective,  one  either  discards  it  or,  less  realistically, 
replaces  it  with  a non-defective  unit.  In  Figures  1 through 
4,  q » i-p,  f * sampling  frequency,  v * 1-f,  and  the1  transi- 
tional probabilities  are  written  beside  the  corresponding 
arrows . 

. i 

Upon  entering  the  screening  phase  (abbr.  sc) , inspect 
the  production  units  at“ id 0%  until'  I consecutive  units  are 
defect  free;  then  exit. 


Figure  1 

MC  Model  of  Sampling  Phase. 


U*  1) 


one  entrance  and  one  exit. 


Upon  entering  the  unlimited  sampling  phase  (abbr.  uls), 
sample  at  random  with  frequency  f until  a defect  is  found 
(during  inspection);  then  exit. 
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I 


Figure  2 


MC  Model  of  Unlimited  Sampling  PhaBe. 


SN  = Non inspection  State 
SI  3 Inspection  State 


Upon  entering  a limited  sampling  phase  (abbr.  Is) 
sample  at  frequency  f until,  condition  1,  k units  are 
sequentially  found  to  be  defect  free  ort  condition  2, 
a defect  is  found  before  condition  1 is  satisfied;  then 
exit  to  the  condition-dependent  next  phase.  ■ 

i 

* 


Figure  3 

MC  Model  of  Unlimited  Sampling  Phase. 


one  entrance  and  two  exits 


Upon  entering  the  checking  phase  (abbr.  ck)  inspect 
at  100%  the  next  m units  discarding  Tor  replacing)  all 
defective  units  found;  if  the  m units  are  all  defect 
free  exit  one  way  or  another  different  way  if  one  or 
more  defects  are  found. 


Figure  4 

MC  Model  of  Checking  Phase 


one  entrance  and  two  exits. 


q q 

0-*c  o » c i » . 

lp  lp 

cTT *CT ► • 

l l 


q q 

— ► C (m-2) 

- 1’ 

►cf(m=3!T 

1 1 


C(m-l) 

1 


2.4  Sampling  Plans  Having  defined  the  sampling  phases, 
it  is  now  an  easy  matter  to  describe  the  two  most 
practical  ones,  CSP-1  and  generalized  CSP-2 , as  well  as 
one  which  contains  all  four  phases  — generalized  CSP-3. 
The  diagrams  in  Figure  5 are  self-explanatory;  also,  as 
indicated  by  the  diagrams,  the  terms  CSP  and  MC  model 
will  be  used  interchangeably  unless  otherwise  stated;  the 
use  of  the  word  "generalized"  is  necessitated  by  standard 
usage  which  requires  k *»  I for  the  Is  phase  and  m * 4 for 
the  ck  phase. 


Figure  5 
Sampling  Plans 


CSP-1  [1,  ft  p] 


(Generalized) 
CSP-2  [I,  ft,  f,  p] 


(Generalized) 
CSP-3  [I,  fc*  m,  ft  p] 


3.0  SAMPLING  PLAN  PHASES. 


3.1  Phases  as  SMC  states.  In  Theorems  one  through  four 
below,  two  essentially  different  but  equivalent  methods 
of  proof  are  used;  the  MC  method  and  the  SMC  method. 

Before  launching  into  a Sescription  of  these  two  approaches, 
we  make 

Definition  1.  A phase  is  completely  ordered  iff  its 
states  are  well-ordered  by  the  phase  regime  from  entrance 
to  exit;  it  is  quasi  ordered  iff  its  states  are  totally 
ordered  by  the  phase  regime  from  entrance  tc  exit. 

In  the  MC  method,  for  a given  phase,  an  absorbing  MC 
is  constructed  whose transient  states  or  absorptive  states 
correspond  to  the  phase  MC  states  or  exit  phases  respec- 
tively. Using  an  initial  probability  vector  whose  transient 
components  are  equal  to  the  individual  phase  MC  state 
entrance  probabilities  conditioned  by  the  event  of  initial 
phase  entrance,  the  formulas  for  the  first  entrance  proba- 
bility functions  into  the  various  absorptive  states  are 
then  derived.  Specifically,  for  a given  absorptive  state, 
the  first  entrance  p.d.f.  into  this  state  starting  from 
each  of  the  transient  states  is  obtained;  then  a weighted 
sum  of  these  functions,  each  weighed  by  its  initial 
entrance  probability,  is  taken.  The  result  is  the  desired 
p.d.f.  for  this  particular  exit  phase. 

In  the  SMC  method,  a given  phase  is  broken  up  into 
its  constituent  ri'C  states  by  treating  any  state  with  self- 
transitions or  no  self- transitions  as  a non-degenerate  or 
degenerate  MC  (noncanonical)  phase  respectively.  Proceeding 
according  to  the  MC  method,  the  appropriate  p.d.f.'s  for 
each  of  these  (noncanonical)  MC  phases  are  then  obtained 
thereby  constructing  from  each  such  state  a "mini"  absorbing 
SMC  whose  absorptive  SMC  states  are  equal  in  number  to  the 
possible  exit  MC  states  — exclusive  of  the  state  itself 
(i.e.,  no  self-transitions  are  allowed  in  the  SMC).  These 
"mini"  absorptive  SMC's  are  then  amalgamated  into  a composite 
absorbing  SMC  whose  transient  states  are  now  the  correspond- 
ing "mini"  transient  SMC  states  and  whose  absorptive  states 
are,  once  again,  the  relevant  exit  phases.  Finally,  since 
a first,  entrance  occurs  at  an  epoch  in  a SMC,  one  can 
proceed  to  mimic  the  MC  method  to  derive  the  first  entrance 
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£■  probabilities.  In  particular,  for  a canonical  MC  phase  which 

is  either  completely  ordered  or  can  be  subdivided  into  sub- 
phases which  are  (e.g.,  the  Is  phase),  this  method  can 
proceed  inductively  — the  absorbing  state  or  subphase  at 
step  h being  the  (h  + l)gt  state  cr  subphase  respectively. 

Thus,  in  the  end,  with  either  method,  we  have  con- 
structed from  a given  phase  an  absorbing  SMC  with  one 
y transient  state  and  absorptive  states  equal  in  number  to  the 

t possible  exit  phases.  Moreover,  though  the  setup  given  in 

Chapter  two  for  an  absorbing  SMC  can  be  formally  used  for 
the  amalgamation  in  the  SMC  method,  the  proofs  below  which 
> use  this  method  will  be  given  more  constructively  and, 

< hopefully,  more  intuitively.  Nonetheless,  the  absorbing 

r SMC  apparatus  will  always  lie  in  the  background.  Further- 

more, outside  of  the  context  of  any  irreducible  SMC,  we 
shall  hereafter  refer  to  a phase  with  its  p.d.f.'s  either 
as  a potential  SMC  state,  for  eventual  inclusion  in  a CSP 
■ (irreducible  SMC)  or  as  a transient  SMC  state  in  the  con- 

« struct ive  sense  of  the  SMC  method;  both  viewpoints  are 

’ mathematically  equivalent,  the  nuances  different. 

In  Theorems  one  through  four  below.  At*)  will  always 
stand  for  an  absorptive  state;  for  instance,  if  a phase 
has  only  one  possible  exit,  the  symbol  "A"  alone  will  be 
used;  if  two  or  more  exits  are  possible,  the  symbols  "A.", 
"A2",  ...,  or  "A ( 1 ) " , "A (2) " , ...  will  then  be  used.  1 

Theorem  l.  The  screening  phase  is  a potential  SMC 
, state  witK'  pTcT. f . gi/en  ass 


r 


K 


jlu~^  ■ , Y » pq1. 


Proof.  (MC  method) 
a.  Absorbing  MC  is  given  by: 


HO 

HI 

H2 


HI 

q 

o 

o 


H2 

0 

q 

o 


H3 

0 

0 

q 


H(I-l) 

A 


0 

0 


0 

0 


0 

0 
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H(I-l)  A 

0 0 

0 0 

0 0 . 


0 

0 


q 

i 


b.  Initial  probability  vector  = (1,  0,  0,  . .., 


0) 


c.  From  the  Chapmann- Kolmogorov  (C-K)  equations,  we 
got,  from  a.  and  b. , the  following  system  (letting  Hj  * j): 

pi,a  - pps;i + qpj+i,A’ 


p?-i,a  - pps;i  + *‘1 


since 


PA,  A - "M  * Vk) 


P£,A  ’ V*’ 


d.  Because  of  b.,  we  want  to  obtain 

<^S,A 

which  is  just  ()  »(n). 

SC,  A 

e.  But 

P0, A * <f0,A*PA,A>„ 


P0,A  “ f0,A  * PA,A 


“ “0 , A * H0* 

Thus-  *0,A  " (”0rlp0,A' 


S 


f.  From  c.  we  obtain  by  substitution: 
1-1 


,n 


0,  A 


or 


j-0 

1-1 


J Fa  « tH 

Va  - j'o'”  + q 


0 

IT 


Simplifying  and  using  geometric  series  summation,  we  finally 
have 

S0,A-¥fe  • 

z (z-1)+y 

g . Thus  from  f . , e . , and  d . , we  have 

q tz)  . aiiiia)-  . 

sc, A Z1(Z-1)+Y 


Second  Proof.  (SMC-method) . 

a".  Since  the  sc  phase  iB  completely  ordered,  we  can 
proceed  by  induction.  For  I » 1, 


Q. 


yields  QQ>A  - q/(z-p) 

■ q(z-q)/(z-p) (z-q) 


q ( z-q ) 

iTz-rr+Y^  • 


>'.  sTj #H(j+3 


1 I q 

#H(j+l) *A 


Qscj,A(j+D (z)  ■ fz-i?+v j * Yj  ■ pqj 


by  induction. 
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r— 'WW  iHW»M>  • - ’ * *■*-•' r9*&*{&* 


®j+l,A(Scj)  ^ * ' Z 


Qj+l^A(z)  = -|-  , where  (j+1)  = H(j+1). 


o'.  Amalgamate  (j+1)  and  scj. 


ThUS  °SCjfA  ” {n^Q(^SCj,  (j+l)°(j  + l),8Cj)n^8Cj  (j+l)®(j+l)  ,A 


1-Qscj,(j+l)Q(j+l),scj 

__ aj.+1J.g-sLL T;n 

z3  1 (z-ljpq-^z-pq^z+pq^ 

-rZj+1(z-<l)  . 

z3+1(z-1)+Y,a1 


Third  Proof . (SMC  method  - no  induction)  This  third 
proof  is  given  to  further  elucidate  the  SMC  method. 

a".  Mini  states:  all  are  degenerate  except 

for  0.  working  with  the  corresponding  mini-abBorptive 
SMC ' s , we  have 


'0,A(1) 


<z>  “ 5 h 


» A ( j+1)  f- 


Qj,A(0)(z)  " Ht 
where  A (I)  “A. 


0<j<(I-l) 
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which  implies 


A ^ A 

T>  a f P 

SN,A  'a-u1  SI,A 


PSI,A  ” z-lhpSN,  A + Aq^O' 


Since  PgNfA  ■ fSN,AH0 


and  PSI,A  “ fSI,AIJ0 


we  have  upon  simplifying! 


t 6 . i p(z-u) 

fSN, A “ z (z-ST  and  fSI , A " z (z-$) 


d.  From  b. , we  want  to  obtain 


(0)fSN,A  + {f)fSI,A  " Quls,A(n)* 


n 


e.  Transforming  d.  and  using  the  last  two  formulas 
in  c . , we  have 


u (af5qrr)  f (zTf=vr  z=7  • 


f.  Thus  Q « . (z)  - ^-r  from  e.  and  d. 


nils,  A 


6 

z=|f 


Second  Proof.  (SMC  method) 

f 


a"‘  QSN,A(SI)  z-u 


A 

QSI,A(SN) (z) 


5=?3  ) 


Thus  we  have  two  non-degenerate  mini  SMC  states  and  two 
mini  absorbing  SMC's  with  the  following  embedded  MC's: 


SMC(SN):  SN 

A (SI) 

SI 

SMC  (SI):  A (SN) 

A 


b ' . Now  once  again  assemble  the  two  mini  absorbing 
SMC's  into  one  aggregate  absorbing  SMC.  The  result  is  an 

absorbing  SMC  with  two  transient  states  (SN?  OgN  ^(z))  and 
(SI;  6SIfSN(z)»  ^si,A*z*>?  one  absorptive  state  (A;  ^A/A<z)> ' 
and  an  embedded  MC  given  by 


SI 

SN 

A 


SI 

0 

1 

0 


SN 

uq/ (1-fq) 
0 
0 


A 

p/(l-fq" 

0 

1 


c'.  Hence,  we  now  want: 

(’’^SN,A{2)  + (f)FSI,A(z)  = ^uls,A(z) 
bv  d.  We  can  write  down  expressions  for  the  F’s  directly: 


A A OQ  A A A 

FSN,A  = QSN,SI{n|0 ^RI,SN°SNrSI)  }QSI,A 


°SN,SIQSI,A 

A A 

1-csi,snqsn,si 
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*SI,A  " {n^0(^SI,SN^SN,SI>n}®SIrA 
^SI,A 


1“QSI/SNQSN,SI 


d'.  From  c'.  and  simplifying, 

A 

Q. 


ulS , A 


(z)  = (uQSN,SI+f)QSI,A 


1-QSN,SIQSI,SN. 

6 

z-6  * 


is  a potential 


Theorem  3.  The  limited  sampling  phase 
SMC  state  with  p.d.f.'s  given  by: 

Sls,Ml)<Z)  - 

°ls,A(2)  <z)  * ff3j,k  ' 

First  Proof.  (MC  method) 

a.  Ordering  the  states  of  Is  as:  SNQ,  SIQ,  . . 


SNj^,  SIk_lf  A1#  and  A2;  we  have  an  initial  probability 
vector  v ■ (u,  f,  0,  0,  — -);  i.e„,  vgN0 


u and  vSIQ  - ft 


the  matrix  corresponding  to  the  absorbing  MC  can  be  easily 
written  down  if  one  desires. 

b.  For  convenience,  we  combine  SNg  and  SIQ  into 
SN0V  BI0  * S.  Then,  using  the  C-K  equations  again,  we 
have 

m ap^— 1 x p^— 1 

S,A(2)  qiS,SI(k-l)  S,A(2) 


<fq)P 


S,SI(k-(j+l)) 


(l<j<k-l) 


+ (f)P 


S,SN (k-j) 


PS,SN(k-j)  " (uq)PS/SI(k-(j+l))  + (u)PS,SN(k-j) 


PS,SNO  “ uPS/SU0+(u)40tn) 


PS,SIO  " fPS,SNO+(f)fiO(n*  * 


c.  Letting  a - Pg,A(2),  b..  - Pg#SIj,  and  Cj  - PB#SNj, 
then,  using  the  Z-tranaform  in  b.  and  simplifying  yields: 


a " (A)bk-l 


(l<j<k-l) 


ck-j  “ (z^J>bk-(j+l) 


&bk- 


(j+1) 


/.  bj  - <;|g>jb0  for  l<j<k-l 

/.  •“  *)<^r>k'lbo 


'0  z-u  * 


••  a " ”o(z^u)k  i using  *•'  ^ls,A(2) (z)  " (f^u) 


For  exit  to  A (1) , we  have: 


i. 


i 

| 


| 

V 

l 

m 

t 

? 

| 

I 

I 


pn 

fs,a(1) 

which  implies 
PS#A(1) 


d r ^ 4.  p 

Pj«0  s'SIj  S,A(1)PA(1),A(1) 


Pb0 (Z)  ,,  ,fq  vk,  (*-u) 
Tz-TT  (1~(i=u>  )1J=TT 


-1 


Again  using  a,  and  multiplying  through  by  (HQ)  , we  have; 


Second  Proof.  (SMC  method) 

(j+1),  for  0£j<k-2 

a'.  Let  A (j+1)  - 


A(2)  # for  j - k-1 


Noting  that 


are  all  structually  equivalent#  in  contrast  to  the  sc 
phase  where  HO  differed  from  Hj,  0<j<I-l,  we  could  use 
the  inductive  method  applied  to  2-state  subphases;  how- 
ever# it  is  easier  to  use  the  SMC  method  directly 
applied  to  2-state  phases  thereby  skipping  one  logical 
step  in  the  amalgamation  procedure. 


b'.  For  given  k>lr  we  have 
Vi.sij " f/(z-u) 

®SIj,A(j+l)  “ <5/’1 

«SIj,AU>  ■ p/z 


6fl5 


Therefore  for  subphase  j,  we  have 

6j,A(j+l)  “ °^SMj,SIj^SIj,A(j+l)  + f^SIj,A(j+l) 

» fq/(z-u) 

^ j ,A  (1)  " (u)^SNj,Slj®SIj,A(l)  + (f)^SIj,A(l) 

- fp/(2-l>) 

c'.  Hence  we  obtain  mini  absorbing  SMC'e  which  have 
one  transient  SMC  state  (two-MC-state)  and  two  absorbing 
states  for  each  j.  Amalgamating  as  before,  we  get  k 
transient  states 

k-2 

6jfA(l>)>JLl  {(<k~1>,  °<k-l),A<2)'  0(k-l),A(l) 

and  two  absorptive  states  A(l)  and  A (2).  Its  embedded  MC 
is  i 


0 12 

0 |0  q 0 

1 I 0 0 q 


(k-1)  A (1)  A(2] 


(k-1) 
A(l) 
A (2) 


d The  initial  probability  vector  is  now 
(1,  0,  ...  0).  Thus  we  now  want 

(1) (^0,A(1))  " ^ls,A(l) 


have 


(1) (^0,A(2))  " ®ls,A(2) * 

e '.  Once  again  using  a constructive  derivation,  we 


. n 


0,A(2) 


(Q0fi)  * * ^(k- 

■<Shr> 


(k-1)  ,A(2) 


) 


and 


0,A(1) 


<l2_)k 

'z-u' 


(fy),A(l))  + <6o,l5  (^1,A(1>)+  - 

^0,1* ^1,2* * * (^k-l,A(l)) 

(ISL)  3 } 

Tz-orja0'z-u' 


l-(£3-)k 

~ Ti=?T(i-(ui'*qV,(2‘u) 

- <5=e'<i-(|^)k). 

Theorem  4.  The  checking  phase  is  a potential  SMC 
state  with  p.d.f. 's  given  by 


<W(2) ,z)  * (q/z)m 
Wu>(z’  - - 

First  Proof.  (MC  proof) 

a.  The  absorbing  MC  transitional  matrix  is  easy  to 
write  out  from  Figure  2. 

£ b.  Ordering  the  states  CO,  Cl,  — , 57,  — A, » A,, 

we  have  v - (1,  0,  0,  — , 0)  as  the  initial  A 

probability  vector. 


c.  Proceeding  aa  before,  we  have 
PC0,A<2)  " PC0*a(2)PA{2)  ,A(2)  + Vn)ql" 

PC0,A(1)  “ PC0^A(1)PA(1) ,A(1)  + 4m(n)tl“q  ) * 

d.  Z-trans forming  c.  and  solving  we  obtain  the  result. 

Second  Proof.  (SMC  method) 

a'.  Since  we  neither  have  a possible  re-entry  to  an 
initial  state  at  any  step  nor  a natural  segmenting  of  ok 
into  (cj,  cj) , we  use  the  SMC  method  directly. 


b“.  The  functions 


* 

Qcj,A(j+l) 

°cj  ,E73+T> 


(s)  - 

(*)  . , 


where  A (m~IJ  ■ A(l)  and  A(m-l)  » A(2),  and 
(0<j<m-2)  ^jrT3TTr(2)  - -i-  ' 

where  A (m-l)  - A(l)  again,  make  up  the  pieces  to  be 
assembled  in  the  usual  way. 

c'.  We  want  and  (D*cO,A(2)  * lotting 

Cj  - j and  cj  - J,  we  have 

P0,A(2)  * (^0,l)(Ql,2>  {0(m-l),A(2))  “ (“i~>m 

P0,A(1)  “ <%,?>; — •(ter,A(D) 

(%,l)^l,r); •(6m=7,A(l)) 


(®0,1) (®1,2} • • (^m-l,A(l)> 

_ m-1  j 

■ ( E q-*) 

- i=s£ . 


The  last  four  theorems  clearly  show  that  once  the 
logical  structure  of  a more  intuitive  MC  model  is  known, 
SMC  techniques  can  be  vastly  superior  to  the  more 
pedestrian  but,  perhaps  at  first  sight,  more  straight 
forward  MC  techniques. 

Theorem  5.  {A  compendium  on  the  four  phases).  The 
long  run  mean  values  for  time-of-sojoum  and  (potential) 
transitional  probabilities  for  embedded  MC's  associated 
with  the  four  canonical  phases  are  as  follows i 


SCI 

“sc  " 

> w - u 

ulsi 

“uls  ' 

" Tp  * quls,A  • 1* 

lsi 

yls  “ 

yls,A(l)  + ylS,A(2) 

- - qkU+J^)}  + {qk(i+-j<-)} 

- i-qk  _ rtk 

Tp  » qis,A(i)  i-q 

qlS,A(2)  * q ' 

— * 11  ck  * yck,A(l)  + yck,A(2) 

* rad-q^)  + mqm 

" ra;  qck,A<l)  " 1”qn'f  qck,A(2)  “ qIU  * 

Proof.  If  (a  } is  a probability  sequence,  then  its 
mean  is  given  by 

(-zDza(z))  |z  » 1 

where  a,(z)  is  its  Z-transform.  Secondly,  Hft*a(») 

- a(z)  I z = 1.  0 
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3.2  Self  jumps  and  MRP's.  The  SMC  method  suggest*  the 
following  considerations.  If  a given  phase  is  completely 
ordered  with  a possible  return  to  the  initial  MC  state  at 
each  step  or  is  only  quasi  ordered  with  possible  random 
re-entries  to  each  of  its  MC  states  at  each  step*  then 
corresponding  variant,  transient  SMC  states  can  be  con- 
structed  by  this  method  by  adding  the  phase  itself  as  one 
of  the  exit  phases.  In  either  case,  due  to  the  alterations 
in  the  p.d.f.'s,  the  resultant  SMC  state  now  has  self- 
transitions — a fact  that  necessitates  defining  the  MRP 
as  the  primary  object,  the  induced  SMC  as  secondary,  in 
contrast  to  the  "no  self- jump"  situation  where  they  are 
equivalent.  Below,  this  approach  and  some  of  its  implica- 
tions are  examined  for  the  sc  and  uls  phases. 


Theorem  6.  If  self-transitions  are  allowed  for  sc, 
we  have  t 


sc , sc 


Sc(z)  " (itq)(1”(  2 } 5* 


SC,  A 


Proof . a.  HO  is  now  treated  as  a degenerate  MC 
state  and  any  return  to  it  is  considered  to  be  a (self) 
transition  of  sc.  Letting  j - Hj  and  sc  ■ A.,  we  there- 
fore have  the  following  system t 


b.  The  system  in  a.  implies,  upon  Z- transforming, 


H0  (z 


♦ **  * — ♦ - 


<fra:,si-i.v 


where  P 


I— 1 , A^ 


t-  V 


*“■  f 0 ,Aj  ’ fi0  '-f->  < «-f">  j > 


Letting  A (sc)  ■ * sc,  we  have 

6ec,ic^z)  " HH  (?=q)  a“("*“)I) 

c.  Pq  a ■ qx*J"xA 
U#A  2 A2#  * 

^>5c,A(z)  " HH*  * 

Corollary  1.  If  ic  denotes  the  screening  phase  with 
self  transitions  as  defined  in  Theorem  6,  we  have 

“ie  " lF  ' %,h  " i-*!1'  “0  - -J1- 


Proof,  u-rr-  » — + w—  « 

— SC  SC,  SC  SC /A 


« - Iq1)  + Iq3 


ia£ 


by 


differentiation  of  the  Z-transform;  rest  is  trivial. 
Corollary  2.  Letting  ic  be  as  above, 


‘sc,  A 


Proof. 


'so,  A 


°ic,A 

1_Qsc,Ic 


(n«0 <^ic»sc>  )0Sc,A 


and 


fact  that  Q-—  — -(1)  * 1-q  <1. 
sc,  sc 
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mu* 


The  expansion  in  the  proof  to  Corollary  2 above  can 
be  given  the  following  interpretation: 

" (Q >3q «• 

'usc/scJ  v/ic,A 


means  j defects  before  transition.  However,  though  the  use 
gf  sc  leads  to  a probabilistically  natural  expansion  for 
Q_„  it  is  not  necessarily  the  most  practical  (except 

5C  | A 

perhapB  for  small  N)  due  to  an  accumulation  of  factorial 
terms;  more  practical  complete  formulas  useful  for  all  N, 
can  be  developed  by  considering  purely  analytical  expan- 
sions [see  Ref  7.1].  Secondly,  sc  throws  some  light  on 
the  requirement  of  continued  inspection  after  a defect  is 
found  in  the  ck  phase;  ck  would  be  a "one-time"  sc  if 
inspection  were  stopped  and  a transition  made  at  this 
point  in  contrast  to  the  usual  requirement  made  above. 

We  how  turn  to  solf- jumps  for  the  uls  phase.  Having 
in  mind  the  situation  that  occurs  in  the  SMC  proof  of 
Theorem  3 for  the  Is  phase,  we  have 

Theorem  7.  If  uls  were  to  be  allowed  self-transitions 
which  mimic  any  SN-SI  block  of  the  Is  phase,  then 


°snr,sis<2>  ■ f?u  “d  3un,A<2> 


Proof.  Consideration  of  the  following  diagram  provides 
the  proof : .... 


0 

If 

si 

PI 


►A  (SI  ySN) 


Corollary  1. 

W,A 


*UlS,A 


1- 


^uTSfuTa 


Proof. 


°uls,A  " {J0(6inT,uTs)n>QuI¥,A 
Corollary  2. 

* . 

°i.,a<i)  * ^;r^A~(1~(°5re.gi? 

1 ^uTaruIa 

®is» a(2)  " (QsT?(sr«))c- 


)k) 


Proof. 


a.  Q 


>1  A 


- { £ 


l8,A  (1)  " 1 j “q  (QuIs ,uTs) n ^uIFr A 


b.  the  second  part  is  obvious. 

Switching  emphasis  from  the  uls  phase  to  the  Is  one , 
we  can  alternatively  treat  uls  as  a Is  phase  with  k random 

It  may  be  of  interest  to  keep  track  of  the  number  of 
defects  found  while  screening;  with  this  in  mind,  we  have 

Theorem  8.  Splitting  sc  into  HO  and  sc',  we  obtain 
6H0,sc'  * Q8C',H0  “ * (1-  (q/*)1"1) 

Q8C',a  " • 


Proof.  Treat  sc'  as  sc  and  us*  induction;  rest  is 
trivial. 

Because  of  the  complexity  involved  in  evaluating  Q . (z! 
one  might  be  led  to  considering  the  following  variation,1’ 


SC " ■»H1»  ' 

q q 


q ( q 


Then  we  easily  have: 


^sc* » A ^ ^ 

- i-a-)1  . 

vz-p' 


Thus 

Qbc”  A(n)  " - (?:i>  (q1)  (p”’1) 

80  'A  (z-p)  1 1 

- — a result  which  speaks  for  itself.  Furthermore, 

Q „ .(1)  * 1 which  implies  that  sc"  included  in  any  sampling 

SC  0 A 

plan,  in  place  of  sc,  would  still  yeild  an  irreducible  MC 

(or  SMC).  By  considering  the  polynomial  c(z)  * 1-Xq^1”1^  + 

(I-l)q1,  we  can  show  that,  for  0<_q<l,  C (q)j^O— s-^that 
use"  <_  use. 

Throughout  the  above  analysis,  an^ 

have  been  regarded  purely  function- theoretically;  in  Chapter 

2,  Q.  . , i / j,  has  a natural  probabilistic  definition  given 

within  the  context  of  a SMC.  It  is  reasonable,  therefore, 

to  search  for  a corresponding  interpretation  for  Q.  j within 

j » j 

some  similar  stochastic  framework.  Such  a framework  exists — 

it  is  a MRP.  This  MRP  is  just  (Y  . W) , where  W„  records 
th  n n n 

the  n~  renewal.  Hence  if  self-transitions  are  allowed, 


Qj^j  (t)  = P £wx  ( j ) - WQ(j)  - t | wQ(j)  = 0] 


As  noted  in  Chapter  2,  if  ^ = 0 for  all  i,  a MRP  is 

equivalent  to  its  corresponding  SMC;  that  is,  while  its 
introduction  leads  to  some  important  renewal  theoretical 
ideas  which  are  useful  in  studying  a SMC  (see  Chps.  4 & 5) , 
it  is  otherwise  superfluous.  Moreover,  if  there  exists 
only  defective  self-jump  p.d.f.'s,  the  MRP  can  still  be 
looked  upon  as  only  a prior  necessary  formality  since  an 
unique  SMC  can  be  derived  from  it  with  the  following  Q**s 
(Ref  7.12] : If  A.  . (+•)  <1,  for  all  i,  we  have 

A 9 1 


(SMC*)  l 


i*i 


if  Qifl  l< 


Qi,j'  if  Qi,i 


In  contrast  to  the  two  previous  cases,  if  thero  is  a j such 

that  A,  . (+•)  * 1,  there  is  then  no  longer  a uniquely 
J r J 

derivable  SMC;  for  example,  a one  dimensional  renewal 
process  leads  to  a trivial  SMC  — but  a trivial  SMC  leads 
to  any  1 dimensional  renewal  process.  In  other  words,  a 
SMC  doesn ' t record  self- jumps;  a MRP  does.  This  is  the 
essential  difference  between  the  two  concepts.  Proposition 
4.1  further  shows  that,  as  far  as  the  constructive  technique 
considered  in  this  paper  is  concerned,  the  distinction  be- 
tween the  two  processes  is  irrelevant. 


4.0  SAMPLING  PLANS  AND  FI (N) . CSP-1  and  (generalized) 
CSP-2,  the  first  two  moments  of  the  FI(N)  functional 
defined  on  them  for  the  Job  Shop  entry  case,  and  the 
connections  between  the  intercalation  of  self  transitions 
into  these  plans  and  Markov  Renewal  Theory  are  investi- 
gated in  detail.  Other  standard  but  more  complex  plans 
are  then  briefly  treated.  However,  to  more  fully 
appreciate  the  practical  results  of  this  chapter  as  well 
as  to  gain  further  insight  into  the  basic  method,  some 
general  statements  are  first  demonstrated. 
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f-; 

p 
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4.1  Preservation  of  properties.  Since  we  are  at  last 
ready  to  deal  with  MC^s  in  their  entirety,  rather  than 
just  selected  pieces  of  them,  the  preservation  of  certain 
properties  in  the  construction  of  a SMC  from  a given  MC 
becomes  a primary  concern.  We  first  prove 

Proposition  1.  Given  a MC  from  which  two  SMC's  are 
constructed:  sMc J is  constructed  using  self-transitions 

for  one  MC-state  (or  phase):  SMC,  on  the  other  hand,  is 
constructed  in  same  manner  as  SMC ' except  without  self- 
transitions.  Then  the  two  SMC's  are  equivalent  in  the 
sense  that  their  sample  paths  are  the  same  (and  the 
transitional  probabilities  are  equal) . 


r.. 

A>'. 


Proof. 

a.  Let  j be  a non  degenerate  MC  state  with  possibly 
multiple  entries  and  exits.  Given  any  exit  MC  state  k for 
j (k  j1  j ) , we  then  have 


Q,  k - ( E (Q:  .)m>q 
•3'  m=0  3'K 


or 


k 

q.  , = — 

5,k 


where,  QC  . (1)  <1  since  the  MC  is  irreducible. 
J • 1 


b.  Pi  j ■ z Q'  .p/  . + or  .pc  . + j • 
3 » 3 j 3 h,  j j,j  j,j  j 


or  pj,j(1“Qj,j)  " r5j,h"h,j  + V1^ 


q; 


6; 


y • i ^ W • 

or  . - z — P.*  . + Hn(l-z  — Ijl® ) 

3,:l  h,;l  0 s u-6^] 
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Similarly, 


i 


>' 

U 

k 

f 


P'  -IQ  .P.  _ for  m and/or  n not  equal  to  j. 
m,n  in / ri  n/ii 


c.  The  primed  quantities  bear  the  same  relationships 
among  themselves,  via  the  Q's,  as  the  unpriroed. 

Theorem  1.  The  four  principal  properties  of  time- 
homogeneity,  finiteness  of  states,  irreducibility,  and 
aperiodic ity  are  preserved. 

Proof. 

a.  All  four  properties  are  trivially  true  for  the 
given  MC. 

b.  By  taking  in  account  the  SMC  method  and  the  concept 
of  filtration  [Ref  7.6,  Chp  8],  it  is  obvious  that,  once  the 
term  "canonical"  is  dropped,  one  can  construct  a multitude 
of  SMC' s from  the  MC  which  in  turn  can  be  considered  to  be 
the  basic  SMC;  in  other  words,  from  the  MC,  a primitive  SMC 
can  be  constructed  by  treating  each  MC  state  as  a degenerate 
SMC  state  regardless  of  phase  segmentation;  in  particular, 
if  a MC  state  has  self “transitions,  then  self- jumps  must  be 
introduced  for  this  state;  by  Proposition  1 and  a. , the 
resulting  SMC  (MRP)  is  equivalent  to  the  original  MC. 

c.  Any  other  type  of  SMC  constructed  from  this  MC 
is  a filtration  of  the  primitive  one. 

d.  According  to  [Ref  7.6,  Chp  8],  filtrations  pre- 
serve all  four  of  the  properties. 

Corollary.  Let  i be  a state  of  an  (irreducible)  SMC 
constructed  from  a given  (irreducible)  MC;  let  be  the 

MC  states  contained  in  i but  also  considered  as  degenerate 
SMC  states  in  the  original  MC.  Then 


Proof,  a.  Proposition  1 implies  that  ag  is  self- 
transition independent,  b.  The  equality  follows  from  a. 
and  Theorem  1 on  the  filtration  of  a SMC. 


i 


i 


r 

' 

i 

: 

t 

; 

■ 


f 


V 


I 

* 


Before  proceeding  to  the  next  section,  we  prove  a 
statement  on  the  rapidity  of  convergence  of  a isolation  to 
a certain  type  of  renewal  equation. 

Proposition  2.  Given  three  sequences  (an>,  {bn>,  and 

(c  } such  that  (i)  c ■ (a*c)M  + b , (ii)  a^  . b >0  for 
n n nn  n * n — 

all  n,  and  (iii)  g.c.d  (k/a^  > x0  }■  1.  Then,  if 

£nkb  < +-  and  Znk+1a„  < 
n n 

we  have 

(H0*b) (n)  _k 

°n  " + 0 (n  1 . 

where  ^^(n)  * na{n). 

Proof.  (Ref  7.10,  esp.  Thm.  4], 

4.2  Sampling  plans. 

4.2.1  CSP-1.  Upon  setting  sc  * 1 and  uls  ■ 2,  CSP-1  has 
the  following  SMC  transitional  diagram: 


with  states  (lj  Q^2(z))  and  (2;  Q21(z));  the  SMC  diagram 

above  should  be  carefully  distinguished  from  the  box  one 
in  Figure  5,  Chapter  2 which  is  a MC  transitional  diagram. 

Since  Q12 (1)  ■ 1 « Q21 (1) , the  corresponding  embedded  MC 

has  the  transitional  matrix 


1 2 
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which  is  periodic  with  period  two;  however,  the  SMC  itself 
is  aperiodic  by  Theorem  3.3  since  supp  (0i2)  * (I+X/X"0  to-). 

Prom  Theorem  2.2,  we  can  easily  write  down  the  Z-transformed 
(F.S.)  for  CSP-1: 


A 

. J1 

A 

A A 

. Qi2J; 

li  STzT  ' 

A A 

P12 

3T5 T 

A 

. °21J1 

A 

. J2 

21  ffzf  ' 

P22 

sTzT 

where 

S(z)  — 1 “ Q]_2^21  * 

Because  of  the  simplicity  of  CSP-1,  the  above  system  can  be 
written  down  directly  from  combinatorial  principles. 

The  eigen-vector  equation,  eT  - e ,for  CSP-1  yields 
e - (1/2, 1/2)  as  a solution j therefore,  using  iiBC  and  nulg 
from  Theorem  3.5  we  have,  by  Theorem  2.4s 


P (-) 
BC ' ' 


pul.  <*> 


BC 


uls 


f(l-q*)+q 
q1 

f T • 

f(l-qI)+qI 


results  which  can  also  be  obtained  directly  from  the 
Z-transformed  (F.S.)  through  additional  use  of  l'Hospital'a 
rule.  From  Proposition  2.3,  we  also  have; 

P12*®12 r p2l"°21f  *ll“°12°21' 
and  F22-Q21Q12(-P11). 

An  application  of  Proposition  2 is  found  in 
Theorem  1. 

P,*(n)  - VQ21*J2<n>  + o(n“k), 

1 

for  arbitrary  k. 
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Proof . 


a*  P12  “ Q12*P22 


P22  " °21*P12+J2 


P12  “ <Ql2*Q21)#P12+Q21*J2 


P11*P12+Q21*J2 


...It 


b.  **»  (Q2i*J2>  (n)  < + - 


lnIc+1P11  (n)  < + - 


which  follows  from  Theorems  3.1  and  3.2  along  with  repeated 
differentiation  of  the  transforms  to  obtain  the  higher 
moments . 


c.  He  are  done  by  a. , b. , and  Proposition  2. 
Corollary.  (1 • Hospital ' s rule  from  a renewal  eq.) 

££  pi2(n)  • =ar^T»T  I1"1- 


n—  ” — V 11 

where  b(n)  ■ Q2i*J2^* 
Proof. 


a.  c * a*c  +b  with  conditions  of  Proposition  2 
n n n 


holding  for  k « 0,  then,  if  Hn*a(»)  ■ 1,  we  have 


c (z) 


b (z) 


l-a(z) 

b.  Therefore,  from  a., 


Lim  z-1  * _ Lim  z-1  b(z) 
ail  T“  c'*'  z 4>i  T"  . » . 


z 4-1  z 


Lim 


l-a(z) 

6(z) 


Z"i 

S(z) 


-za “ (z) 


z«l  . 


TOO 


There- 


c.  But  c(-)  - the  limit  on  the  L.H.S.  in  b.  . 
fore  we  are  done  by  letting  a ■ P^1 . 

Of  importance  in  analyzing  the  FI(N)  function  is  the 
monotonicity  of  P12(*)  We  Prove 

Theorem  2.  (Monotonicity)  P12 (n) 

or  P22  is  monoton ically  nondeoreasing  or  nonincreasing 
respectively. 


Proof. 


a.  AP11(n)  - Pj^  (n+lJ-Pj^  (n) 

- -Q12<n)  + - P^j^Q^Cn)) 

b.  " pu)  <n> 

- PlWjU)  + Tx  2 - n)  - PlWjU)  - n] 

- PtW^l)  + T12  - n and  T12  t 0] 

I 0» 

c.  Thus  AP;L1(n)  <_  0 from  a.  and  b.  . 

d.  P12(n)  “^H0  “ pll^n*  c*  AP12  (n)  >_  0, 

In  the  same  way  we  can  show  that  P22 (n)  i 0 which 
finishes  the  proof. 

Before  moving  on  to  CSP-2 , we  prove  a statement 
concerning  the  roots  of  the  fundamental  polynomial 
(F.P.)  of  CSP-1. 

Proposition  3.  The  denominator  of  rational  function 


1-q12(z)q21(z) 
is  FP(z)  ■ 

(z-q) (z1  - pzI_1  - pqz1”2  - ...  - pqI_1) . 


Proof.  FP(z)  « (?(z)  - «q'L(z-q) 
from  Theorems  3.1  and  3.2. 

In  [Ref  7.1],  FP(z)  is  obtained  directly  as 

zI(z-0)+(#,  Q = fpq1  P = 1-4;  thus  the  SMC  approach  gives 
some  insight  into  the  root  distribution  of  FP(z). 

4.2,2  CSP-2.  Upon  letting  1 - sc,  2 * uls,  and  3 - Is, 
CSt>-2  has  the  following  SMC  transitional  diagram: 


with  states  (1;  Q12(z)),  (2;  Q23(z)),  and  (3;  Qn(z),  Q32<z))j 

once  again  the  SMC  diagram  should  be  carefully  distinguished 
from  the  one  in  Figure  5,  Chapter  2.  Since,  in  addition  to 

Q12(l)  * 1 - Q21(l),  we  also  have  Q31(l)  “ 1-q*  and  Q32(l)  “ c. 

the  embedded  MC  has  a transitional  matrix 

1 

T = 2 

3 

which  is  however  aperiodic;  again  the  SMC  is  aperiodic  for 
exactly  the  same  reasons  it  is  for  CSP-1. 

Because  of  the  increased  complexity  of  CSP-2,  we 
proceed  to  derive  the  basic  transitional  probabilities 
more  formally  than  with  CSP-1. 

Proposition  4.  Transitional  probabilities  of  CSP-2 
(first  row) . 


u0 (1"Q12) (1"Q23Q32} 


Ql2H0  ^"®23^ 


®12®23I10  }{^32^  ^ 


where  G - 1 - <>23  *Q32+Q31Q12*  * 

Proof . (F.S.)  for  CSP-2,  Z-transform,  and  Cramer's 

rule  for  linear  algebraic  systems  which  holds  since  there 

exists  an  R such  that  lzl  > R=Sg(z)^0. 


Upon  solving  the  eigen  vector  equation  for  the  embedded 
MC  in  the  CSP-2  case,  we  obtain 


a - (i 2C  , -1 i-) 

— v c ' c ' c ' 


m (e. , ®o/  flj) 


where  c * 3-q  . Combining  this  result  with  Theorems 
3.1,  3.2,  3.3,  and  2.4,  we  have 


P («) 
sc 


f (l-qk) (1-q1) 


Puls<"> 


Pls(")  " ^s  “ ' 


where  D - (f)  (l-qk) (1-q1)  + (2*qk)(qI). 

For  future  use  in  studying  the  FI(N)  functional  for 
CSP-2,  we  now  give  an  example  of  the  uses  of  filtration 
to  combine  SMC  states  2 and  3 into  one  (super)  state 
with  and  without  self  jumps. 
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Case  ii  self-jumps 


In  thia  case,  we  are  allowing  the  events  2+2 , 2+1,  and 
1+2  to  filter  through)  thus  tho  filtered  set  ie  juat 
(3).  The  corresponding  states  of  the  filtered  SMC , whose 
transitional  diagram  appears  above,  are  (T)  j)  and 

(Jl  ^23^31'  ^23^32* 
where 

" Q23Q31  * 

^1,7  “ ^23^32  ' 

anti  „ 

“ Qi2  • 

The  transitional  matrix  of  the  filtered  SMC's  embedded  MC  is 


In  this  case,  we  are  only  allowing  the  events  2+1  and  1+2 
to  filter  through)  thus  the  filtered  set  is  again  (3). 
However,  the  corresponding  states  of  this  filtered  SMC 
are  now 


(h  Or?)  and 

^3032 


since 


°5.I 


31* 


Again  for  future  application  to  the  analysis  of  FI (N) , 
we  use  Case  1 to  prove 

Theorem  3.  (Expansion  theorem) 
eT,I<n>  - f (l)pl,2*<«0  +1*223>*S>(,» 

where 

S<n)  - jt0tl)1Ui((1)P12))V223>‘3> 

Wo  1 

which  converges  for  ------  '(^^)  <1. 

"(1)"  is  CSP-1,  1 » sc,  2 - uls,  and 

1R223^  * Plx{n)»3|  x(n-l)«2,  x(0)*2  and  x(k)^lf  0<k<n-l] . 


Proof . For  convenience,  let  6^  ■ xafa  and  ■ xa  b 
if  the  latter  is  different  from  the  former. 

a.  Using  this  notation,  we  have,  from  the  (F.S.)  for 
the  filtered  SMC, 


"r.r 


H0x12  ^x21+x22^  ^ 
1- (xi2x21+x22^ 


b.  1 *22  " x12x21 


1 " x22  " X12X23X31 


1 “ x22  " x12x23 ^X23“X22^ 
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- * 


l 


(1-X22)  - Xi2x23[<l-x22)  + {l-x23)] 


a-‘x22)  <1-jc12x23)  + (1"x23)x12x23 


■ (l-x22)  ^“X12X23^ 

a X12^"X23^  X23 

where  a - ■,(* (— — > . 

U x^2x23  1-x22 


c*  X12  ^x21+x22^  ^ m X12  ^“x23^  ^^"x3i^ 


since  x31  - x23  - x22 

d.  a.,  b.,  and  c.  therefore  give 

”0*12  (1“x23)  , 1 . 
T7  (l-x12x23)  (l-x22)  1+a 


#12  (1  + t~->  (777) 


(1)  12 


^**x22  ^+a 


which  is  just  the  Z-transform  of  the  assertion;  we  finish 
this  part  by  noting  that  Q23  - Q21  for  CSP-1. 

e.  This  factored  expression  approaches 


u2  1 

(r+r)  (1  + — V ( 

W1  W2  l-q* 


^2  1 
1+(r+r-)  (“JTc) 

yl+W2  1-q* 


however  we  also  have,  for  the  third  factor. 


5tl  *<»>  - m Ho1(ftOi(l)) 


it? 

b(«),  endpoint  property, 
H0*a<-) 


m2  1 
W1W2  l-qK 


a (n) , for  all  nj 

thus  the  conv.  criterion  suffices. 

4.3  FI (N)  functional.  Before  embarking  on  a detailed 
examination  of  the  moments  of  FZ (N) , the  following  funda- 
mental theorem  is  proven  to  illuminate  the  analysis  of 
variance . 


Theorem  4.  Let  {aft}  and  {bfi)  be  two  positive  sequences 

such  that  (i)  a„  + or  + A and  (ii)  b„  + or  + B as  n-*-».  Then 
n n 


N 

E <a*b) (k) 
k-0 


Proof.  We  will  only  prove  this  for  an+A  and  bn+B. 
a.  ; 

¥ 

ca*b  + (110*»>  ls> 

n2  ir 


(B) 


(H?*b) (N)  H*(N) 

— 5 =-5 (A)  - -Sij—  AB, 


N 


N 


where  H*  ■ hq* *HQ  k times. 


707 


b.  0 < *L*rl)±L*r*} 

N2 

< /Z<a~A)  4 ,s(b-BK 
— ' N M N ' 

which  approaches  zero  as  since  ordinary  convergence 
implies  conv. 

(H^a)  (M) 

0.  

ir 


(1+-r> 


l HQ*a(N)  Hq*  0>  (n) 

— ' r * » 


but 


V <a> (n) 


N 


N 


N 

Z na_ 
n*0 


n 


N 

*uai 


N 


M 


< AN  (N*f  1 ) + gN(a  -A) 

- 2N2  N2 

A i ® (an”A) 

■ -iHHH  + 


A 


N 


dp 


H0*  <l(>  <N> 


N 


N 

- E r 
0 

IT 


~-N-y-'  -*-1/2  as 


2N 
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therefore 


H0(N)  . 1 , H0(N>  »0‘<H0>  <»> 

^ 1? ? 


1 - 1/2  * 1/2,  as  N+*. 


„ V 0<W 

*•  (1+  N ' H " "3 

N 


A i-A 


i— A - -i- 


i 4 Vb(N>  VO  <») 

(1+ir)  — s " — 


B ~ —i— B 


N 


4-b 


from  c.  and  the  hypotheses,  as  N+<». 

f.  Thus  the  whole  expression  approaches 
0 + —jg—AB  + —ij—AB j-AB 

* 4^b  by  e. , d. , and  a.  . 

4.3.1  CSP-1 , Job  Shop  entry.  The  form  FI(N)  takes  for 
this  MC  is 


FI  (N)  - 1 - L Mcm  ^ 
j-0  5N 
N 

Thus  here  we  want  E(HQ) [FI(N) j which  is 

1 L.  ? Pj 

N j«o  HO , SN 

- AFI (N) . 

However,  since 

u = P[SN  | uls]  , 
we  have 

SC,ul3vj;  H0,SN 
or 

Eao(»X2(j))  - EH0(MSN(j)]. 
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■k 
i i 

I'-' 

I 


Summing  up,  keeping  the  monotonic  growth  property  of 

PSC,Ul3(,)  in  mind'  W*  h*VO 


Theorem  5. 
AFI(N)  =1 ' 


N 
Z P. 


N • q sc.uls 


(HU 


A? I (N)  + ^auls  a* 


Since  Var(l-w)  = Var 'W)  and  VAr(W)  « E tW2 J - (E  £wj  > 2 , , 
we  will  henceforth  generally  restrict  the  discussion  to 
second  moments  of  w.  To  deal  with  the  variance  of  Pl(N) 
in  the  MC  case  requires  that  the  following  expression  be 
considered; 

^HO  ^SN  ^ ( j+k)  ] * Pu 


SN 


HO ,SN  SN, SN  ' , , 

0 _<  j,k  4<  N.  However,  since  the  variance  for  CSP-1,  J-S 
entry.,  is’  treated  from  this  view  point  in  17.1] , , we  will 
use  the  2 state  SMC  for  CSP-1  — relating  the  results  to 
those  obtained  for  the  MC  model j for  a treatment  of 
variance  which  uses  a three  state  SMC  (i.e.,  sc,  SN,  and 
SI),  see  Chapter  5. 

Proposition  5.  Letting  sc  **  1 and  uls  *2, 

(n)X2(n+k)j  - P12  (n)P2J?  (k) . 

Proof. 

PlX(o)«l,  X(n)“2,  X(n+k)-2] 

P (M(o)=H0,  M(n)  *SN  y SI , M(n+k) »SN ysi] 

P (M  (n)  aSN  y SI  j M(o)=HO]P{M(n+k)*SN  ysi  |M(n)-SNVSI] 

P12  ^ P22  * 


The  result  can  also  be  seen  by  treating  2 as 
degenerate  such  that  at  each  step  either  2+2  with  prob- 
ability B or  2+1  with  probability  6. 


no 


I s 


us  consider 
, *wX2(j) 

E1[( jj )2]  , E1{W2]. 


Using  the  convolution  and  proposition  5,  we  have 


E fW2!  « lu  unN 

1 n2  o 12  *22  lN' 


where  ^ - Ejtl-W].  Now,  we  further  here 


SNySl,SN  “ UPSN,SN 

„n 


- ...  ♦ £ PSI,SN 


psx,s»  - q<«>!£SN  * »*£**,)  * PP^1SN 


qPSNVSI,SH  + PPHo!sK 


Therefore,  substituting  (2)  into  (1),  we  heve 

n 


or 


?sNysi,SN  - “pSn,sn  + f<3P3tiysi,SN  + 


SN 


or 


2,2  aPSN,SN  + fquP2,2  '►  fpup.^  2 (n-l) 


P2,2{n)  - *JP2,2(n-l)  - fpP  (n-x,  * pn 

SN, 


SN 


Thus  from  (3)  we  have 


2u 


JJI  (H0*P12*P22{N)  " (fq)H0*p12*P22{N-l) 


' {fp)VP12*Pl2<N-l))  - -5 

N* 


(1) 


(2) 


(3) 
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,1 


m ehqI (l-FI(N) )2] . Thus  bound*  can  be  developed  for  the 

2-etate  SMC  and  then  tranBlated  into  bounds  for  the  MC 
or  primitive  SMC;  Theorem  5 also  shows  the  proper  con- 
vergence (hence  the  factor  2). 

4.3.2  CSP-2,  J-S  entry.  For  this  plan,  the  MC  formula 

is 

N k-l  N 

FI(N)  - 1 b-  ( £ M-j-U)  + 1 

« s.q  B j-0  S"0  J 

Because  of  the  fact  that  we  can  consider  7 as  being 
randomly  entered  at  each  step,  given  that  it  is  entered, 
we  have 

ProDoaition  6.  Letting  I - sc  and  7 - uls  for  the 
self -lump  filCraTTon  of  CSP-2,  we  have 

k-r 

uEylXyU)]  - + ^HO^SN^1* 


Proof. 

uPr,i(B) 


k-1  n 


£ 

j— 1 


k-1  n 


where  SN(-l)  - SN.  Thus  we  have 
Theorem  6.  For  CSP-2 , 

N 

AFI(N)  - 1 K-  £ Py  ▼ • 

N s-0  1,4 

Py  y(s)  can  be  expanded  in  terms  of  (i)Pi2^  ^ Theorem  3 
and  therefore  (2)AFI(N)  can  in  turn  be  expanded  in  terms 
of  (1)AFI(N). 
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To  deal  with  the  variance  for  C8P-2 , we  will  split 
the  Is  state  (phase)  into  k new  (sub)  states  in  order  to 
make  use  of  the  convolution  as  in 


Proposition  7.  Letting  a»l*sc,  (SN-SI)"U.ls*-l#  and 
(SNj-SIj , we  have 

P (X (o)»a,  x(n)-j1/  X(n+s)-j2J  - Paj  (n)p.j j (•) 

t 

where  j,  (j2)  range  from  -1  to  k-1. 

Proof.  As  in  Proposition  5. 

Proceeding  as  in  the  C8P-1  case,  we  can  express  the  variance 
in  terms  of  the  states  a,  and  j,  -l<j<k-l . Thus  we  will 
restrict  our  attention  to 


k-1 
K S 
j— 1 


N 

i x. (a) 

s«o  3 

— fi 


Use  of  the  convolution  and  Proposition  7 

_1 

N"  ~ J N“  N* 


2H0*Pa1*Pii(11) 


Vfajl 


gives 

(N) 


Ve»idJb3'’  * -pWV 


..  Ho,p.j,pj.i ><*>  . yp..v*pi-.t(M) 


N 


r 


N' 


since  P^*(0)  - Pj  (0)  - 0 if  j ? j'. 
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Summing  up,  we  have 
Theorem  7« 

EaUl-PI(H))2] 

* 2H0*Plj'*pj',J<M>  - Ho*p.,j(M>>- 
We  note  that  as  N-»-  »,  the  expreaaion  in  Theorem  7 approaches 

*2 


k-1 

■ ( I in.) 
j— 1 j 


2 


by  Theorem  4. 


Readjusting  notation  again,  sc-a,  uls-b,  otherwise 
the  same,  we  can  rewrite  the  non-negative  terms  of 
Theorem  7 as  follows  (factoring  out  2/N*) 


vp«b‘<aybj  ♦ phb» 


Letting  UB  ■ EP8j  + P^,  including  s»b,  we  have  the 
following  equations! 
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• • * 9 


pab<ub)'  pa(k-l) ^k-l*  with  stat88  a»  b»  o»  . ..» 

(k-1)  and  transformed  p.d.f.'s  yQ  ■ Qfa  Q - 4/(z-B); 

ya  " Qj,a  " «/(*-«)  0<j<k-lf  y » Oj  j+1  - fq/(z-u) 
tor  0<j<k-l)  - QJ[_l  b. 

Using  the  Z-transform,  we  obtain  the  following  relation 
ships  among  the  U *s  above. 

B 

«h  “ vA  + H (l-y  5 


uo  - yux  + H0(l-y-ya)  + yaA 


Vi  “ A + flo<1-yya)  + ya* 

* k-i  „ 

where  A « l P . + P . . We  also  have 
j«0  a3  80 

ftoa-y)  - r/U-6)  - jb=^||Jb||N  ■■  BN 
h0 d-y-y,)  - «/<*->»  - jp=^f|j,||N  - „■ 


where  ||*||N  “ He*a  (H> . 

Using  this  system,  we  can  progressively  get  bound's  on 
the  Uj's  and  therefore  eventually  on  the  variance  for 

FX(N).  For  instance,  letting  k-1,  we  would  have: 


Hubl 

1 N 

< 

(1-UN)  | 

Ml 

|N  + ‘N 

INI 

In 

< 

q(l-uN) 

II  “i 

||  „ + va-A 

IMIn  + “N 

INI 

l» 

< 

q(l-oN) 

II0** 

||  N + 

INI*  + *" 

Working  out  a full-blown  expression  for  a bound  on  the 
variance  is  extremely  tedious,  but  now  possible. 


4.4  Other  sampling  plans.  Setting  l»sc,  2«uls,  3*ls, 
and  4*ck,  the  SMfc  transitional  diagram  of  CSP-3  is: 


with  states  (1;  Q12),  ®24J'  ^3f  ' and 

(4i  Q41,  Q43)  • 


Sample  solution  is 


^0^12  ^“®24^ 

"’I-STafl 


where 

G(Z)  “ G24°43°32  + ®12^24®41  + ®12^24^43°31# 
The  eigen-vector  solution  for  its  embedded  Me  is 
e ■ (e1#  e2,  e3,  e4) 


where  a • 3 + qm(l-qk). 

Other  plans , like  the  multi  level  ones,  feature 
phases  of  the  type  already  met  but  with  different 
sampling  parameters  for  the  same  types:  for  example. 
Is  (k  fj) » Is  (k  f 2 ) t etc . 
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5 . 0 ARBITRARY  ENTRY  CASE . In  elementary  (continuous) 
renewal  theory,  the  introduction  of  a delayed  p.d.f.  is 
necessary  for  dealing  with  the  equilibrium  case.  If 
F(x)  is  the  distribution  function  of  the  ordinary  process, 

w the  (long-run)  mean  time  between  renewals,  £(x)  the 
delayed  p.d.f.,  and  kl(x)  the  mean  number  of  renewals 

in  time  x for  the  equilibrium  process,  it  can  be  shown 
(Ref  7.8]  that 


f(x) 


(1-F (x) ) 


and 


me(x) 


x 

T 


In  Cinlar's  paper  (Ref  7.6],  analogous  results  are  also 
shown  to  hold. 

Following  Cinlar  [Ref  7.6,  Chp  9],  with  modifications 
for  the  discrete  time  case,  we  have  on  the  one  hand 


Definition  1.  Letting  25 . . (t)  be  the  delayed  p.d.f 

we  have 


• » 


On  the  other  hand,  for  the  Arbitrary  Entry  case  of 
a MC  model  of  a CSP,  using  either  the  MC  or  SMC  methods, 
the  initial  probability  vector  for  a given  phase  i is 
now  given,  overtly  dependent  on  the  structure  of  the 
entire  MC,  by 


where  CL  is  the  long-run  probability  for  SMC  state  i and 
»a  is  tne  analogous  long  run  probability  for  SMC  state  s 

which  arises  from  that  filtration  of  the  primitive  SMC 
which  forbidB  SMC-state  self-transitions.  Thus,  we  can 
also  define  a delayed  p.d.f.  as  in 
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Definition  2.  Letting  (L  j be  tho  delayed  p.d.f.r 
then  we  define  it  as 


1 r- 

Mi  fll 


t 

fs,A(j)' 


t 

where,  once  again,  f SfA(j)  i»  the  firet  entrance  probability 

(at  tine  t)  into  phaae  j starting  initially  from  state  s in 
phase  i with  probability  oB/d^. 


Two  points  concerning  Definition  2 should  be  made.  In 
contrast  to  the  J-S  case,  w^  is  "us«d  just  once”  since  any 

jump  to  j returns  the  process  to  the  J-S  case.  Secondly, 
even  through  w^  appears  to  be  plan  dependent,  it  is  shown 

later  in  this  chapter  that  it  is  not}  intuitively  this  is 
reasonable  since  aB/a^  can  be  interpreted  as  the  relative 

time  spent  in  phase  i starting  from  s. 

Below  a.  the  & are  constructed  for  all  canonical 

phases  considered  in  the  J-S  case}  b.  definitions  one  and 
two  are  proven  equivalent  by  elucidating  the  relationships 
between  the  primitive  SMC  and  any  filtration  of  it  (thereby 
showing  w^  to  be  plan  independent) } c.  any  SMC  (or  MRP)  is 

shown  to  be  stationary  if  its  initial  p.d.f.'s  are  given 
by  Definition  one}  and  d.  the  steady  state  SMC  is  derived 
for  CSP-1  and  a bound  on  the  variance  of  FI(N)  is  obtained. 

5.1  Definition  equivalence.  Extensions  of  the  techniques 
used  here  to  include  variations  from  the  four  standard 
phases  are  straight-forward.  In  Theorems  one  through  four 

below,  im  . shall  have  the  meaning  assigned  to  it  by 

■f  A 

Definition  2. 

Theorem  1.  Definition  equivalence  for  sc. 

Proof,  a.  Again  letting  HK-K,  we  have  from  the 
basic  Mfl  system  for  sc, 
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where 


qI_k  ,zk-p. 


k-1 k— 2 

-pqz 


■-pq  ) 


q1"* 


0k(M) 

“m 


0k(x)  - zk(a-l)  + vk#  Yk  - pqk» 

and 

?<*>  - 0x(z). 


b.  We  now  proceed  to  split  sc  into  three  consecutive 
subphaeees  sc(k),  H(k),  and  R(k).  Thus,  letting  1 ■ sc(k), 
2 « n(k) , and  3 ■ R(k)  for  simplicity,  we  have! 


where  a is  the  next  state  of  the  plan.  This  splitting  yields 
the  following  transitional  matrix  for  the  embedded  MC 
obtained  from  this  variant  SMCi 


1 

2 

3 

a 


2 

1 

0 

0 


3 

0 

q 

o 


■ T' 


- 0 0 J 

where  r * I-(k+l).  Letting  e'  ■ (e£,  e£,  e^,  e',  — ) , 
we  obtain  the  eigen  vector  equation 


e'T'  » e'. 

This  equation  in  turn  leads  to  the  following  algebraic  system: 
POj  + (l-qr)e£  + v « e£ 


e; 


qox 


- e2 
“ e3 

qre£  + u ■ e' 


(E0) 


I ■ 
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This  matrix  induces  the  following  linear  system  via 


sc 

e.c+u  * ®a 


•:-®.c  ^ ' 


K: 

where  v * e • col  , u - e • col"#  coir  is  obtained 
cola  by  setting  its  first  component  equal  to  zero, 
from  (Bq),  we  also  have 


u - e ' - qI-ke£  1 

v.,K  lj 


<E2) 


Therefore  from  (£.)  and  (E2) , we  have 


8C  1 (E 

<*.  - •;>  J(  : 


) 


(We  also  know  that  ej  - ej  for  j^(l#2r3)). 

f.  Therefore,  from  (Eg)  and(E^) , we  get 


escwsc- 


*D'  - D. 


Thus  finally  from  d.  and  the  above. 

e q*"1 
a.  - 8°q 


H(k) 


or 


aH(k)  “ (y~~)  (qX  I)osc  ; 

SC 

1-1 

As  a check,  we  have  £ ■ a..  . 

0 H(k)  sc 

g.  From  a.  and  f . . we  have 

®H(k)  J _ X V(,) 

~=rr  £k,A  r-  TTiy 


sc 


sc 


eT  * e i 


from 

However 
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h.  Since 


1-1  1-1  k 1-1  k t 

l 0.  (Z)  - z(  I ZK)  - ( E Z*)  + (l-qA) 
k=0  K 0 0 


- (jiy) (ZI(Z-1)  - qI(z-l>), 
we  have  with  g . , 


I“1  C(l_  A 

E — f 


k-0  *»o  "k'A  “bo  7Xt' 
H 


- £ ^ 


The  proof  is  finished  by  noting  that  Qac  A(l)  " 1 and 
qI(z-q)/^(z)  - QSc,A(z)- 

Theorem  2.  Definition  equivalence  for  uls  phase 
Proof. 


a.  From  Theorem  2.4, 
■ Pa,SI(,:> 


SN  " °“uls 


®SI  “ f“uls 


b. 


SN  t 


+ 4L.  f. 


“uls  SN'A  “uls  SI'A 


“ ufSN,A  + ffSI,A  • fr°m  a* 


"uls , A 
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. • A> - »«)4u..  ilMr  *- 


..4  -*«  . 


c. 


h 

Juls 


~ (i — L.) 


(5^T> 


6 

2-8 


or 


Hi 


'uls 


(1  - Q 


uls, A 


fuls,A  ; 


JUl8,A 


which,  along  with  b.,  finishes  the  proof. 

Theorem  3.  Definition  equivalence  for  Is  phase 
Proof,  a.  Break  le  into  (SN-SI)  blocks  as  follows! 


0 to  h-lf 


n h 


flh+l  ,to  k-li 


Since  all  the  blocks  are  structurally  equivalent,  we  have 
from  Theorem  3.3, 

U1  " Tp“  ' y2  " 7?  ' and  u3  - (eq) 

where  r ■ k - (h+1) . 

b.  In  the  split  system,  we  can  put  the  three  segments 
of  Is  first  in  the  corresponding  transitional  matrix  of  the 
embedded  MCt 


1 

2 

3 

a 

b 


0 

0 

0 


q 

o 

o 


3 

0 

q 

o 


T' 


T23 


t 


Then  e'T'  ■ e'  induces 


v = °1  1 
I °£  - °2  (“l1 

''e2  - °5  ) 


”l  k 

Therefore,  £ e'us  ■ ^ (1-q  ) from  (EQ)  and  (B^) . 


Therefore,  = e£w^a  and  hence 


(h) 


(qh/f)e£ 

01  o m n > 


D2+elwls 


D2  - SejUj,  j * (1,2,3) 


c.  In  order  to  get  the  necessary  relationships  between 
the  primed  and  unprimed  systems,  we  need  some  additional 
conditions  on  exits  and  entrances;  we  assume  the  usual 
"CSP-2"  type: 


Is 


■*b 


, a * sc  or  ck 
b * Is  or  uls 


1-q 


k 


Using  the  above  assumptions,  we  can  now  fill  in  the  a and 
b columns  of  T : 


12  3 

aS(l-qh,  1-q,  l-qr,  *) 

b;<  0 , 0 


qr , *>* 


where  is  the  transposed  vector  of  v.  The  resulting 
"augmented”  matrix  leads  to  the  system  (E2): 


<l-qn)e£  + pe'  + (l-qr)e'  + u - e^ 


qre^  + w 


e; 


(e2) 


72b 


! i 


i ■: 


3 


where  u - e*  • col  *,  w * e'  • col and  22i(.)  *s  derived 
from  col ^ ^ by  setting  the  first  three  components  equal  to 
zero.  Then 


(Ej^)  and  (E2) 


e'  - u « e£ {l~q) 
eb  " w " elqK 


d.  The  standard  transitional  matrix  T is 


1-q  q 


Prom  eT  • e,  we  get 


a-q  )ela  + u - ea 
q e,„  + w - ek 


where  v * e • coll8,  u ■ e • col* , w « e • col£,  and 
coljaj  is  gotten  from  col ^ ^ by  setting  the  first  component 
equal  to  zero.  Finally,  from  (E2)  and  (E^),  we  get 


el  " v - els 
ea  " ea 
eb  " eb 


e.  Thus  from  (Eg ) and  the  last  part  of  b. , 


W Vf)«i. 

D 


i 
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or 


l(h) 


f.  As  in  Thaorsm  2,  «SNh  • we^  agI^  * f®^  which 


together  imply  that 


k-1  o 


k-1  o 


r SNh  1 . “ Sin  2 . rl. 

hio  “is  SNh'A  h-0  -is  SIh'A  h “Is  h'A’ 

g.  From  e.  and  f. 

*h,A(l)(z>  ’ 

W,<z>  - 

Therefore,  sum  for  A (2) 

' (A qh  (lrlh)  (l^)k 

yls  h-0  H 

i-  fz~u\k 

» r — { — r ■ 'r — > {^3-}k 

/ul8  f-z+u ' 'z-u' 


H 


i_  (q*  - (fa.)*,  . 


Is 


h.  Letting  x » 6/(z-6)  and  x'  » (fq/tz-o)^,  and 
again  using  e.  and  f . , the  sum  for  A(l)  is 


k-1  . 

hio  'h'Rfl> 

= (X  - -~i(qK-x')x) 
uls 

A 

^1  M i H v 

- — - (-4^  x - q x + xx'  - x + x) 
Mls 
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- — {(l-qk)  (x)  + 1)  - x + xx') 

“la  ° 

A 

^1  k 

- — { (1-q  / - x(l-x')} 

“Is 

A 

• £ «U.*C1>  ,l> 


since  (Cz-l)/a)  + 1 - (z-p)/j  ■ 1/x  which  finishes 
Theorem  3. 

Theorem  4.  Definition  equivalence  for  ck  phase. 

Proof,  a.  Splitting  ck  into  its  MC  components  and 
using  induction  along  with  the  C-K  equations , we  have 


A 

fCj,A(b) 

A 

fCj,A(a) 

and 


i-cTi 

Jh 


> 0<j<n-i 


%,A(.) 
where  a and  b 


- TRj+ir  • 

z 

are  defined  through  the  following  diagrams 


b.  The  transitional  matrix  T'  is  too  bulky  to  write 
down  here,  but  we  do  order  the  states  as:  CQ , ..., 

Cm_1,  C*0,  . ..,  Cm_2 / a,  b,  in  what  follows.  From  the 

eigen-vector  equation,  we  get 
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•6  * v 


" ej+i 

0<^<n-2 


<V  ' 


p®6  " *6 

•j+POj+l  " ®j+l 
(0<^jym-3) 


(Ex)  , 


'«  ■ u + *i-l  + *i-2 


•b  ' w + 


where  o j - e 

w ■ • col* 


> <®5> 


Cj'  *5  " *ffj'  v " 1'  ' S2io»  u " 5. 


■ •'  • coir » 


(col 


and 


f tool-)  j * 
:)j  - * 

I 0 , otherwise. 

f (colb) j,  if  j + m 
L 0f  otherwise. 


(oolg)j  • 

From  (Eq ) and  (E^)  we  get 

•j  ■ *6  <l"*3+1)  1 (b3) 

•j  ’ ‘o’3  J 

But  now,  again  from  Theorem  2.4, 

■cj  ■ 


*Ej  " D*  4X14 


m-1 


m-2 


D'  - E e/  + E e/  . 
0 * o * 


where 


Again, 


<v 


/m-1  , m 

E e/  - eX  r*£L  ] 

n k 0 p J 


m-2  . m 

E e'  - [m-(-=3_)] 


Therefore  D'  - me^  . 

c.  From  the  transitional  matrix  of  the  original 
process,  we  get 


<l-qm)eck  + u - ea 
* eck  + w " eb 


ck 


(E4) 


I . 


From  (B2)  and  (E^)  , we  finally  get 


*6  " eck 
eb  " eb 
**  “ °a 


(e5) 


d.  Thus  c.  and  b. 


L-a1^ 


<l-a 


ck 


and 


*C  j 


— ) ®ck 
•'ok  CK 


e.  From  d.  and  a. , 

m-1  a , * m-1  _j  „ _ „ . 

t -Si  f . - £ 3 — (_3 -.)m(JL.)3 

j-0  °ck  cj'A(b)  j«o  •'ck  z q 

.m 


_ 1_  ,*  -1>  / q 'Hi 

“dk  ^ ’ 
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- i |,  t fu  | liil  ifalfr-  '■*» 


i«  v 


Jck 


ok,A(b) 


(qW  * °ck,A(b) (1,)* 

f . Once  again  using  d.  and  a. , 

!si 


m-1  a_4  * m-2  a; 


£ MM*  f . . . . 4 £ ■— WM  . t t 

j-0  “ok  ej  ,*{«)  j.0  «ok  eT.AU, 

l SLL.  (±_JL__)  4 £ -li-S L . 4rrrr 

j-0  '“ok  V=T  j.o  "-ok  «"-(5+li 


m-1 


m-1 


_L_  <V  ,V  - + V ,3  ♦ "i*  «3+l  . Y <„,!♦»> 

•■ok*  0 0 0 


m-2 


Mckz 


. _ m-1  . m-1  4 _ m-1  a 

i-_  {(l-qm)  + E (qz)3  - E (qz)J  + (l-q®)  I *3) 
m 1 1 1 


1 ,,  _m.  ,zm-lv 

— (i  q ) (in-) 


wokz 


^(U-q")  -Q0k,A(a)«*»> 


Corollary  (to  Theorems  1-4) 
a8/a^  is  pi an- independent. 

Proof.  (clear). 

5.2  Equilibrium  sampling  plans.  Having  shown  the  aquiva- 
lenoe  of  the  two  definition*  in  5.0,  we  can  now  turn  our 
attention  to  the  fundamental  SMC  system  for  delayed  p.d.f.'s 
("*"  and  " — " have  been  replaced  with  a prime  symbol). 

Since  a first  transition  returns  the  equilibrium  system 
to  the  ordinary  non-delayed  one,  we  havei 


where 


(F.S.)f  P£fk(t)  - + <3i,k)Jk(t) 


Ji  - Ho'1-8k>  • 


For  this  eye ten  we  have 
Lenina. 

. . « H.  A 

Jk  -■  Ho  - -iC  Jk  ■ 


Proof . 

a.  i;  - • 


b.  But , 

A 1 A 

M — -T 

ukm  zy^  k,m 


m°*'m  ’ Jk  * 


o.  Thus,  since  HQ/*  ■ H^,  we  are  done  by  a.  and  b. 

Theorem  5.  (Stationarity)  Given  a CSP, 

a_  * (c^,  . . . , an)  is  a stationary  distribution  for 


pi,j (t)* 


Proof. 

a.  Statement  of  Theorem  is  equivalent  to 
j ajpjrk^  “ ak'  lt>0) 


or 


E a.P 


o,.H, 


j j,k  k 0 * 
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b,  L.H.S.  of  last  aquation  In  a. 

“II  O i()j  -P*  «_  * ~ 

j 8 i j'*  *'k  + “kJk 
« wx  + W2,  for  (F.S.)' 

e.  W1  ‘ ' “J  ' °j,i>P.,k 

■ ] Jb16!"1!,.  - 

a 

■ IT  (J  {*  *J<Ij.1P«k 

- * *j  <J  Oj.p.K>  ♦ "‘jX-A*  * Vkl 

(tha  last  two  terms  summing  to  zaro  slnca  la j - 

A 

' B1  « Vik  - * ejpjk  + V»» 

a 

«1  - 
" IT  Vk 


"l 

" yj  °kJk  • 
d.  By  tha  lemma, 


W2  - “kJk 


a H1  a 

■ »k'K0  - il  Jk> 


J 


c.  and  d.J 


W1  + W2  " °kH0 


which  completes  the  proof. 

Defining  (t)  as  the  number  of  reourrenoes  of  the 

state  k in  (0,  t J , we  have  an  analogous  result  concerning 
the  stationer ity  of  the  renewal  functions 

Theorem  6. 

VV*”  - ■ 

Proof. 

a.  Ea[Mk(t)]  - £ 

sov  hk  - + 

«k"o  - <t“Ak>  (1-0k»  + «k3k 


or 


“k<Ho-Jk> 


1-Ql 


r 0iR£k  . 


b.  But  J'  - H0  - HgQ^ 


••  H0  “ Jk  " H0"H0+H0Qk 


- H0Qk 


«0H1 

-Sr  (1^> 
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c.  b.  and  a.  assays fl|[N|C(t)  ] " (jp) » latting 

a (*)  * HqH^ , we  have 

a(n)  - /'  ■ n 

(z-1)2 

- (~)n. 


5.3  Variance  for  CBP-l_and  FI(N).  We  oonelude  this  chapter 
with  an'  application  of  Theorem  5 (and  Theorem*  1 through  4) 
in  obtaining  an  upper  bound  to  the  second  moment  of  (l-FI(N)) 
for  CSP-1.  To  avoid  repetition  of  Chapter  4,  we  split  ula 
into  its  two  nondegenerate  subphases  thereby  dealing  with 
a smc  with  3 states.  Analogous  methods  can  be  applied  to 
CSP-2  as  well.  Letting  S(N)  - 1-PI (N) , we  have 


4 


(I10*PSNfSN)  (N* 


*SN 


N 


< 4-  « 


SN 


Ho(N)  ,Vpsn,sn(n> 

H ( SgASS 


since  Hq  and  Hq*P8N  8N  are  monotonioally  nondecreaeingj 

1 fl,  1 % fH0*P8M.8H(M,j 

" T“  °SN  <1+  nr>  ( ft  — ) • 


Thus 


, 1 Hq*Psn  sn(N) 

Vls(H>>  1 « «SNa+  -h~) ( sm'8N — > 


bbnme  j 

TT*  Eo} 


But  PSN,SN  “ QSN,SIPSI,SN  + JSN 

A A A A A 

PSIf SN  “ QSI,SNPSN,SN  + °SI,1P1,SN 


; . Qsn.siQsi^i.sn^sn 

SN,SN  n_5  A ) 

U WSN,SIUSI,SN' 


(QSN,SIQSI,1P1,SN)S  + JSNS 


" A P1,SN  4 B * 


Now,  simplifying 

k'z>  * TTi^ST 

which  implies 

vA<n>  - £S " e<n  11 ; s = o <e2> 

and  A 

B (a)  - (a-fq)/{z-e) 

which  implies 


(E 
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H0*B(n)  - 1 + -f-  (1-0")  I 


(Eq),  (Ex),  (E2),  and  (E3) 


_ 2 v . _ . 1 .,VA*P1.SN<N>  . VB(N\  “SN 

EaI(S(N))  ) < agN(l  + -jp)< + g ) - -jp 


„ . 1 * <VP1,SN<N>>  . V8(«,  °SN 

- “SN(1  + — H(H0*A(N))  > - -|p 

“ Ogjj(l  + — jjp)  { (1-0**  ^)(1-API(N))  + — jjp  (1+— j— (1-0^) ) } - -jp 


6 . 0 CONCLUDING  REMARKS . We  conclude  this  paper  with  two 
examples  of  the  direct  use  of  SMC  theory  followed  by  a 
short  summary. 

6.1  CSP-1  In  tandem.  Consider  two  CSP-1  plans  arranged 
in  tandem i i.e. , the  output  of  the  first  is  the  input  of 
the  seaond.  This  kind  of  sampling  procedure  (along  with 
further  iterations)  can  practically  arise  when  each  pro- 
duction unit  is  being  inspected  for  2 (or  more)  defects. 
An  example  of  what  is  involved  in  a two  dimensional  MC 
model  of  this  situation  is  now  given. 

For  0 < jx(j2)  < “ 1<I2-1>' 

(H(jx+1),  H(j2+1)) 


(HO,  Hj2). 


•(Hj1,  H j 2 ) 


(H  ( j.  +1)  , HO)  . 


Upon  working  out  all  the  remaining  transitional  probabilities, 
it  quickly  becomes  clear  that  such  a model  is  time  homogeneous 


Let  us  now  consider  collapsing  the  two  dimensional 
model  into  a one  dimensional  one;  the  result  is  non- 
Markovian  (the  reverse  process,  constructing  a MC  model 
out  of  a non-Markovian  one  through  additional  variables, 
thereby  yielding  higher  dimensional  states,  is  called  the 
method  of  supplementary  variables).  Specifically,  we 
obtain  a non -homogeneous  SMC > for  instance,  during  the 
time  interval  (k,  k+1) , J ■*“  J+T  with  probability 

PT»  jn(k»k+1)  “ t*il*k>qi  + P12  (k)  0l)q2 

where  the  first  factor  is  derived  from  the  first  plan. 
However,  if  we  consider  the  first  CSP  to  be  steady  state, 
the  result  is  a time  homogeneous  SMCt  for  instance. 


QT,3+r(n)  ’ 


q^qj,  n>l 


telal+0lcl2><32'  n-1 


6.2  Downstream  inspection.  Another  example  of  the  direct 
use  of  5MdJ  techniques  is  downstream  inspection  in  a CSP-1 
settingi  if  upon  inspecting,  a defeat  is  found  in  the 
uls  phase,  go  to  an  intermediate  one  and  inspect,  at  100%, 
the  1 previous  units;  if  no  defects  are  found,  transfer 
back  to  uls;  otherwise  go  to  sc;  then  proceed  as  in  CSP-1 . 
rhis  modified  CSP-1  can  be  modeled  directly  with  the 
following  SMC  without  the  intermediate  stage;  the  model 
has  the  following  SMC  transitional  diagrams 


T\ 


i 


I 

N 


**1 


5 


n 


with 


S2(l(nl 

a2(2(n) 


Bn_1m  (l-q1) 


8n_1<8) (q1). 


It  should  be  noted  that  if  sampling  downstream  were  in- 
stead sampling  upstream,  we  would  essentially  have  a 
"partial"  CSP-3  since  operational  time  is  measured  by 
the  flow  of  production  units  ~ each  counted  once! 

6.3  Summary.  A simplified  method,  with  some  of  its 
ramifications  and  variations,  of  dealing  with  the 
standard  MC  model  of  a given  CSP  has  been  considered. 

The  essence  of  the  technique  is  the  partitioning  of 
the  MC  into  naturally  defined  segments.  This  blocking 
out  of  (relatively)  many  microstates  into  few  (relatively) 
macros tates  has  been  accomplished  here  within  the 
natural  context  of  SMC's.  However,  this  approach  does 
not  obviate  the  need  for  the  MC  model  in  favor  of  some 
directly  given  SMC  since  the  former  is  initially  likely 
to  be  the  more  intuitive  and  easier  of  the  two  to  con- 
struct. For  a more  practical  explication  on  the  basic 
method  for  the  steady  state  case  not  explicitly  involving 
SMC's,  references  7.3  through  7.5  are  highly  recommended 
(where  the  method  is  called  "A  Simplified  Markov  Chain 
Approach " ) . 
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TRACKING  RELIABILITY  GROWTH 


Larry  H.  Crow 

U.  S.  Army  Materiel  Systems  Analysis  Activity 
Aberdeen  Proving  Ground,  Maryland 

ABSTRACT.  It  is  common  practice  for  a complex  system  under 
development  to  be  subjected  to  a test -fix-test-fix  process.  During  this 
process,  the  system  Is  tested  until  a failure  occurs,  design  and/or 
engineering  modifications  are  then  made  as  attempts  to  eliminate  the 
failure  mode(s)  and  the  system  is  tested  again.  This  process  is  continued 
until  the  desired  reliability  is  attained.  Because  of  these  changes  in 
reliability  and  the  fact  that  test  data  may  be  limited  in  quantity,  it  is 
often  a difficult  task  to  directly  estimate  the  growth  of  reliability  and 
relate  this  to  the  final  reliability  goal. 

A popular,  "common  sense"  procedure  used  for  the  tracking  of 
reliability  is  not  satisfying  because  of  several  major  drawbacks.  This 
paper  gives  improved,  yet  simple,  techniques  for  tracking  the  system 
reliability  through  this  development  process,  along  with  appropriate 
confidence  bound  and  goodness  of  fit  procedures.  Application  of  these 
techniques  to  an  Army  system  is  discussed. 

1.  INTRODUCTION 

Invariably,  development  programs  for  sophisticated,  complex 
systems  require  considerable  resources  such  ax  time,  dollars  and  man- 
power, to  achieve  a lovel  of  system  reliability  acceptable  to  the 
user.  The  reliability  requirements  for  many  systems  are  high,  and  to 
obtain  these  high  goals  it  is  common  practice  to  subject  the  system  to 
a test-fix-test-fix  process.  During  this  process,  the  total  system  or 
major  subsystems  are  tested  to  failure,  system  failure  modes  are 
determined,  and  design  and/or  engineering  changes  are  made  as  attempts 
to  eliminate  these  modes  or,  at  least,  to  decrease  their  rate  of 
occurrence.  If  this  process  is  continued,  and  design  and  engineering 
modifications  are  made  in  a competent  manner,  then  the  system  reli- 
ability will  increase. 

It  is  advantageous,  of  course,  for  the  program  manager  to 
truck  this  increase  in  system  reliability  during  the  development  pro- 
gram. Ilo  may  then  determine  as  early  as  possible  whether  or  not  the 
system  reliability  is  growing  at  a sufficient  rate  to  meet  the 
required  goal  and  allocate  available  resources  accordingly.  In  this 
regard,  u progrum  manager  wishes  to  determine  from  test  data  the 
current  rcl lability  status  of  the  system,  estimate  the  rate  of  growth, 
and  obtain  projections  of  future  expected  reliability. 

Since  the  system  configuration  is  continually  changing  under 
this  test-fix  process,  there  is  usually  limited  test  data  available  on 
the  system  for  a fixed  configuration.  Consequently,  direct  estimates 
of  system  reliability  for  a fixed  configuration  would  generally  not 
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enjoy  a high  degree  of  confidence  and  may , therefore,  have  little 
practical  value. 

Because  of  these  difficulties  with  the  direct  estimation  of 
system  reliability,  mathematical  reliability  growth  models  are  often 
employed.  Most  reliability  growth  models  considered  in  the  literature 
assume  that  a mathematical  formula  (or  curve),  as  a function  of  time, 
represents  the  reliability  of  the  system  during  the  development  pro- 
gram. The  central  purpose  of  most  reliability  growth  models  includes 
one  or  both  of  the  following  objectives: 

a.  Inference  on  the  present  system  reliability, 

b.  Projection  on  the  system  reliability  at  some 
future  development  time. 

Many  reliability  growth  models  are  parametric.  That  is, 
these  models  have  certain  parameters  which  are  unknown  and  must  be 
estimated  from  test  data  generated  during  the  development  program. 

This  paper  considers  a popular  parametric  reliability  growth  model 
which  is  widely  used  in  government  and  industry.  Background  on  the 
derivation  of  the  model  will  be  discussed  along  with  some  major  draw- 
backs with  a "common  sense"  technique  for  estimating  the  unknown 
parameters.  We  show  how  these  drawbacks  can  be  avoided  by  applying 
estimation,  goodness  of  fit  and  confidence  interval  procedures  devel- 
oped at  AMSAA.  Recently  developed  tables  for  computing  exact  con- 
fidence intervals  on  system  failure  rate  and  MTBF  are  given  and  an 
application  of  these  techniques  to  an  actual  Army  development  program 
is  discussed. 


2.  THE  WEIBULL  RELIABILITY  GROWTH  MODEL 

In  1962,  J.  T.  Duane  of  General  Electric  Company's  Motor 
and  Generator  Department  [see  Duane  (3)]  published  a report  in 
which  he  presents  his  observations  on  failure  data  for  five  divergent 
types  of  systems  during  their  development  programs  at  G.  E.  These 
systems  included  complex  hydromechanical  devices,  complex  types  of 
aircraft  generators  and  an  aircraft  jet  engine.  The  study  of  the 
failure  data  was  conducted  in  an  effort  to  determine  if  any  systematic 
changes  in  reliability  occurred  during  the  development  programs  for 
these  systems.  His  analysis  revealed  that  for  these  systems,  the 
observed  cumulative  failure  rate  versus  cumulative  operating  hours 
fell  close  to  a straight  line  when  plotted  on  log- log  paper.  Similar 
plots  have  been  noted  in  industry  for  other  types  of  systems,  and  by 
the  U.  S.  Army  for  various  military  weapon  systems  during  development 


[»••  Crow  (2)]. 


For  a mathematical  interpretation  of  these  straight  line 
plots  on  log-log  paper,  let  Nft)  denote  the  number  of  system  failures 
by  tine  t,  t > 0.  The  observed  cumulative  failure  rate  C(t)  at  tine  t 
is,  therefore,  equal  to  C(t)  « N(t)/t.  The  plots  on  log-log  paper 
imply  tha$  log  C(t)  is  approximately  a straight  line.  That  ig, 
log  C(t)  ■ 4 ♦ y log  t.  Equating  C(t)  to  its  expected  vslue  and 
assuming  an  exact  linear  relationship,  we  have  log  (E[C(t)1)  « 

6 ♦ ylog  t.  Taking  exponentials  gives  E[C(t)]  ■ Atv,  A ■ e*. 

Hence,  E[N(t)J  - At0,  for  g • y ♦ 1,  since  B[C(t)J  - E[N(t)J/t.  Thus. 

the  expected  number  of  systen  failures  by  tine  t is  At0. 

The  instantaneous  failure  rate,  r(t),  of  the  systea  is  the 

change  per  unit  tine  of  E[N(t)].  Thus,  r(t)  ■ j-  E(N(t)}  ■ A0t0“l, 

which  is  recognized  as  being  the  Weibull  failure  rate  function.  It  is 
important  to  note  that  since  the  systen  configuration  is  changing,  the 
data  are  not  honogeneous  and,  therefore,  the  usual  theory  for  a 
Weibull  distribution  will  not  apply.  In  fact,  it  has  been  shown  by 
the  author  [see  Crow  (1)]  that  when  the  configuration  of  the  system 
is  changing,  and  failures  are  governed  by  tho  failure  rate 

r(t)  ■ ABt0"1,  then  the  systen  failure  times  follow  a nonhonogeneous 
Poisson  process  with  Weibull  intensity  function  r(t). 

At  tine  tp  the  Weibull  failure  rate  is  r(t.Q)  • A0to0'*.  If 

no  further  systen  inprovenents  are  nade  after  tine  tQ,  th^n  it  is 

reasonable  to  assume  that  the  failure  rate  would  remain  constant 
at  the  value  r(tQ)  if  testing  were  continued.  In  particular,  if  the 

system  were  put  into  production  with  the  configuration  fixed  as  it  was 
at  time  tQ,  then  the  life  distribution  of  the  systems  produced  would 

be  exponential  with  mean  time  between  failure  (MfBF)  M(t  ) * [r(t  )]'* 

■ t /Afl.  Hence,  for  0 < 0 < 1,  the  MTBF  M(t)  increases  as  the 

" 1 .g 

development  testing  time  t increases,  and  is  proportional  to  t . 
Thus,  0 is  a growth  parameter  reflecting  the  rate  at  which  reli-. 
ability,  or  MTBF,  increases  with  development  testing  time. 

If  this  Weibull  model  is  determined  to  sufficiently  repre- 
sent the  occurrence  of  failures  for  a particular  system  during  devel- 
opment testing,  then  it  can,  of  course,  be  used  to  monitor  and 


project  the  growth  of  system  reliability.  To  do  this,  however, 
would  require  estimating  from  test  data  the  two  unknown  parameters 

X and  0 by  say  X,  0.  One  would  then  estimate  the  failure  rate 

m 

function  by  r(t)  • XPt®"1  and  the  MT0F  function  by  M(t)  ■ (rCt)]"1* 

tl-*A0.  If  the  system  is  tested  to  time  T,  say,  then  M(T)  would 

estimate  the  current  MT8F,  and  M(t),  t > T would  project  estimates  of 
system  MTBF  into  the  future. 

Consider  a "common  sense,"  often  used  procedure  for  estimat- 
ing X and  0.  Suppose  the  system  is  tested  to  time  T,  and  let 
0<Tj<Tj<  ...  <Tjj»Tbea  partition  of  (0,  TJ.  The  observed 

cumulative  failure  rate  at  tine  T^  is  C(T^)  ■ N(T^)/T^ , where  N(T^)  is 

the  number  of  system  failures  to  tine  T^,  i » 1,...,K.  Recall  that 

log  E(C(T^)]  ■ log  X ♦ (0-1) log  T^.  Hence,  if  we  plot  log  C(T^) 

versus  log  T4  on  coordinate  paper  and  fit  a line  by  linear  regression, 
**  « 

we  could  use  y,  the  slope,  to  estimate  0-1  and  0 the  intercept  at 

• I 

t ■ 1 to  estimate  log  X..  The  estimates  of  X and  0 would  be  X ■ e , 

•%  «• 

0 ■ y + 1»  respectively. 

There  are  several  points  to  be  made  about  the  above 
techniques  for  estimating  X and  0.  Firstly,  the  estimates  are 
dependent  on  the  choice  of  T^,  i ■ 1,...,K,  and,  of  course,  may  differ 

for  different  choices.  Thus,  this  method  is  subjective,  yielding 
results  perhaps  not  susceptible  to  rigorous  analysis.  Secondly,  the 
values  C(T^) , i ■ 1,...,K  are  not  independent  since  NCT^)  <.  N(T^)  for 

1 < j.  Moreover,  the  variances  of  the  C(T^) 's  are  not  equal.  In 
particular,  VarlCCT^)]  ■ XT^"2.  If  the  system  reliability  is  im- 
proving (0  < 0 < 1),  then  Var[C(Tp]  is  decreasing  as  increases. 
Hence,  since  the  CfT^'s  are  not  independent  with  equal  variances, 

usual  normal  regression  theory  will  not  apply  to  yield  confidence 
bounds  on  the  parameters  X,  0,  and  the  functions  r(t),  M(t).  Finally, 
in  practice,  the  criteria  for  using  the  Weibull  model  and  this 
estimation  technique  would  probably  depend  on  the  subjective  appraisal 
of  whether  or  not  the  plotted  points  appear  to  lie  nearly  on  a 
straight  line. 


TUU  • 


It  is  apparent  that  improved  goodness  of  fit,  estimation 
and  confidence  bound  procedures  are  needed  for  this  highly  important 
task  of  monitoring  and  projecting  the  growth  of  system  reliability 
during  development.  Using  the  result  that  the  plots  on  log- log  paper 
imply  that  the  successive  failure  times  of  the  system  follow  a certain 
stochastic  process  (i.e.,  the  nonhomogeneous  Poisson  process  with 

Welbull  intensity  Agt®"')  we  have  derived  a variety  of  useful  statis- 
tical procedures  for  thia  model.  Some  recent  results  will  be  dis/* 
cussed  in  the  following  sections. 


3.  ESTIMATION  AND  GOODNESS  OP  FIT  PROCEDURES 

If  the  successive  times  of  failures  are  being  recorded  for 
a system  undergoing  development  testing,  then  a statistical  goodness 
of  fit  test  can  beperformad  to  determine  if  the  Weibull  reliability 
growth  model  is  appropriate.  If  the  model  is  acceptable,  then  closed 
form  maximum  likelihood  (ML)  estimates  of  A and  0 may  be  used  to 
estimate  and  project  system  MTBF.  Using  these  procedures  developed  by 
the  author  in  (1),  ona  can  avoid  the  aforementioned  drawbacks  ssso- 
dated  with  estimation  from  log-log  plots. 

Suppose  that  a ayatea  has  experienced  N failures  during 
development  testing.  Let  be  the  age  (time  on  test)  cf  the  system 

at  the  i-th  failure  i ■ 1.....N.  If  testing  is  stoppad  at  the  N-th 
failure  time,  the  data  ara  said  to  be  failure  truncated. 


I*  I "T 
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The  ML  estimate  of  0,  the  growth  parameter,  is 

(J.l)  6 • (J.'l  N"  *-■> 

l lo,  ** 

l-l  A1  f 


and  the  ML  estimate  of  A is 
(3.2) 


i«l 


A - 


*N 

A A 

Thus,  calculating  A,  $ one  may  estimate  the  failure  rate 
function  r(t)  ■ A0t®_1  by  r(t)  ■ ABt*5"1.  The  MTBF  function  M(t)  ■ 


Jr(t)]*1  1*  similarly  estimated  by  MCt)  ■ [r(t)]-1.  In  particular, 

A * A 

tha  currant  estimate  of  tha  MTBF  la  M(XN)  ■ X^/NB,  and  M(t),  t > X^, 
projects  expected  future  growth  of  systen  MTBF. 

To  determine  the  appropriateness  of  the  Weibull  nodal  for 
representing  the  reliability  growth  for  this  systen,)  one  nay  calculate 
the  statistic 

<*•*>  4 ™ [(sj)  ' TST-]  * 

whtT#  M ■ N - 1*  8 ■ [(H-1)/N]B.  Critical  valuta  of  tht  CjJ  atatistic 

for  M ■ 2 thru  60  have  been  determined  at  AMSAA  from  Monte  Carlo 
sinulation,  using  15,000  saaples  for  each  value  of  M.  Various 
critical  values  are  given  in  Table1  2 of  (1). 

If  the  statistic  Cjjj  is  greater1  than  the  selected  critical 
value,  then  the  Weibull  aodel  is  rejected  at  the  designated  signifi- 
cance level.  If  Cjjji  is  less  than  this  critical  value,  then  the  Weibull 

aodel  is  accepted  and  nay  be  used  to  track  the  systen  reliability 
growth. 


Suppose  that  K >.  1 systems  have  been  simultaneously  tested 
to  tine  T,  where  T is  not""a  failure  time.  In  this  case  the  data  are 
tine  truncated.  If  design  and  engineering  modifications  are  made  on 
all  K systems  at  the  same  time,  then  at  any  time  during  the  testing 
the  systems  will  have  basically  the  same  configuration.  In  this 
situation,  we  may  combine  the  failure  data  on  these  K systems  to 
obtain  estimates  of  X and  6.  These  estimates  and  other  related  pro- 
cedures are  given  in  (2). 


4.  CONFIDENCE  BOUNDS  FOR  MTBF 

In  this  section  we  shall  give  recently  developed  procedures 
for  placing  confidence  bounds  on  current  and  projected  failure  rates 
and  MTBF.  These  procedures  apply  to  the  single  system,  failure 
truncated  situation.  Similar  developments  for  time  truncated  test- 
ing will  appear  in  a future  AMSAA  report  when  completed. 

If  a system  undergoes  development  testing  until  the  N-th 


failure  occurs,  than  r(X^)  IM(XN)]  is  the  current  failure  rata  [MTBF]. 
It  can  be  shown  that  the  ratio  UN  ■ Nr(XN)/(N-l)r(XN)  is  distributed 

independently  of  X and  0,  where  r(X^)  is  the  ML  estimate  X0XNP"  of 

r(X^).  Percentage  points  of  this  ratio  were  obtained  at  AMSAA  from 

Monte  Carlo  simulation  for  N • 2 thru  60.  These  percentage  points 
are  presented  in  Table  1.  Exact  100(1 -a)  percent  confidence  bounds 

on  r(X^)  ***e  of  the  form  [r(XN)a(N-l)/N,  r CXN)b (N-l)/N] , where  a 

and  b are  from  Table  1 such  that  Prob(a  < UN  < b)  ■ l-a.  Equivalently, 

100(l-a)  percent  confidence  bounds  on  M(Xy)  ■ (r(XN)]_1  are  of  the 

form  ([icX^bCN-O/Nf1,  [r(XN)a(N-l)/N]'*) . 

For  N > 60,  lOO(l-a)  percent  confidence  bounds  may  be 
calculated  from  the  approximate  relationships:  a * 1 - ^2/N  Za^» 
b * 1 ♦ /S/N'  Z^2,  where  the  a/2-th  percentile  for  the 

standard  normal  distribution. 

For  N moderately  large,  we  may  also  use  the  percentage 
points  in  Table  1 to  place  approximate  confidence  bounds  on  future 
failure  rates  and  MTBF.  In  particular,  suppose  we  wish  to  place 
approximate  100(l-a)  percent  confidence  bounds  on  r(T),  T > X^. 

These  approximate  confidence  bounds  will  again  be  of  the  form 

fr(T)a(N-l)/N,  r(T)b(N-l)/N] , where  r(T)  « X0T8'1  is  the  ML  estimate 
of  r(T),  and  a and  b are  the  appropriate  percentage  points  from 
Table  1.  Approximate  100(l-a)  percent  confidence  bounds  on  M(T),  the 
MTBF  at  time  T,  are  derived,  as  before,  from  the  bounds  on  r(T). 

These  bounds  become  exact  as  N 


S.  NUMERICAL  EXAMPLE  < 

Suppose  that  a system  undergoing  development  testing 
recorded  the  following  40  successive  failure  times;  .7,  3.7,  13.2, 
17.6,  S4.S,  99.2,  112.2,  120.9,  151.0,  163.0,  174.5,  191.6,  282.8, 
355.2,  486.3,  490.5,  513. 3,  558.4,  678.1,  688.0,  785.9,  887.0,  1010.7, 
1029.1,  1034.4,  1136.1,  1178.9,  1259.7,  1297.9,  1419.7,  1571.7, 

1629.8,  1702.3,  1928.9,  2072.3,  2525.2,  2928.5,  3016.4,  3181.0, 

3256.3.  That  is,  the  system  was  of  age  .7  when  the  first  failure 
occurred,  of  age  3.7  when  the  second  failure  occurred,  etc.  At  age 


71*7 


32S6.3  the  system  had  tha  40-th  failurt.  Froa  these  data,  and 
aquations  (3.1)  and  (3.2)  we  find  that  X ■ 0.761,  6 ■ 0.490. 

To  determine  if  the  Neibull  aodal  nay  be  used  to  track  this 
system's  reliability  growth,  wa  calculate  th*  goodness  of  fit 

statistic  Cj  given  by  equation  (3.3)  where  N ■ 39,  I > (38/40)6 

■ 0.463.  This  gives  C|#  ■ 0.077.  Next,  we  find  in  table  2 of  (1)  that 

for  M ■ 39,  the  critical  value  at  the  .05  significance  level  is  0.218. 

Since  C|9  < 0.218,  we  accept  the  Neibull  nodal. 

A A 

Using  X,  6,  the  failure  rate  function  is  estiaated  by 

A 

r(t)  ■ X8t*-1  and  the  MTBF  function  is  estimated  by  M(t)  ■ (r(t)J*1. 

Hie  current  failure  rate  r(3256.3)  is  estinateri  to  be  r(3256.3)  • 

0.006,  and  the  estimate  of  current  MTBF  is  [.006]"1  > 166.7. 

To  place  90  percent  confidence  bounds  on  the  current  MTBF 
M(3256.3) , we  refer  to  Table  1,  N > 40,  and  find  a > 0.664,  b * 1.40. 
Using  the  fornulas  in  the  previous  section,  we  get  90  percent  con- 
fidence bounds  (0.004,  0.008)  for  r (3256. 3).  Hence,  90  percent 
confidence  bounds  on  M(3256.3)  are  (125.0  , 251). 0). 

Suppose  we  wish  to  place  approxiaat <>  90  percent  confidence 

bounds  on  future  MTBF,  say  at  T ■ 4000.  Using  r(4000)  ■ 0.005,  we 
calculate  these  bounds  to  be  (0.003,  .007).  Approximate  confidence 
bounds  on  M(4000)  are,  therefore,  (142.8,  333.3). 


6.  APPLICATION 

In  this  section  we  shall  discuss  an  application  of  the 
Neibull  reliability  growth  procedures  to  an  Army  (development  program. 
Two  major  points  concerning  the  application  of  this  model  are 
demonstrated.  Firstly,  the  model  may  be  applied  to  discrete  data. 
Secondly,  as  in  any  mathematical  model,  care  should  be  exercised. in 
its  use.  In  particular,  the  importance  and  usefulness  of  the  goodness 
of  fit  statistic  in  Section  3 is  demonstrated  in  this  application. 

Recently,  AMSAA  conducted  a reliability  growth  study  of  a 
missile  system.  The  purpose  of  the  study  was  to  use  historic  data  on 
the  first  801  valid  flight  tests  to  determine  the  growth  curve,  and 


•Iso  to  ascertain  in  retrospect  how  those  data  could  have  been  used 
to  track  and  project  system  reliability  during  development. 

In  reliability  growth  considerations,  it  ir  configuration 
changes  on  the  system  which  are  of  prime  importance.  Consequently, 
in  this  study  the  801  valid  flights  were  ordered  according  to  manu- 
facturing date,  since  this  should  reflect  the  sequence  and  conse- 
quences of  system  configuration  changes  during  development.  The  data 
consisted  of  the  flight  numbers  at  which  a missile  failure  occurred. 
Observed  that  these  are  discrete  data  as  opposed  to  continuous  data 
in  the  model.  However,  it  can  be  shown  that  for  a large  number  of 
data  points,  the  discrete  failure  process  can  be  approximated  by  the 
continuous  model.  This  approximation  improves  as  the  number  of  data 
points  increases. 


The  interpretation  of  r(t)  for  this  type  of  application  is 

that  r(i)  ■ X8i®’1  is  the  probability  of  failure  for  the  i-th  missile 
produced,  i *1,2,....  Hence,  R(i)  ■ l-r(i)  is  the  reliability  of  the 

i-th  missile.  Analogous  to  MTBF,  M(i)  * [r(i)]~*  is  the  mean  flight 
between  failure. 

The  first  step  in  determining  the  reliability  growth  curve 
was  to  use  the  failure  results  for  the  801  flights,  and  equations 
(3.1)  and  (3.2)  to  estimate  the  parameters  of  the  Neibull  model.  The 

goodness  of  fit  statistic  C*,  given  by  equation  (3.3),  was  then  cal- 
culated to  determine  if  the  model  and  data  were  compatible.  The 
value  of  the  statistic  was  highly  significant  (very  large)  indicating 
that  the  model  did  not  reasonably  represent  the  data.  This  implies 
that  a single,  smooth,  Neibull  curve  would  not  reflect  the  decrease 
in  failure  probability  of  this  system. 

Further  investigation  revealed  that  the  development  program 
experienced  a major  re-emphasis  on  reliability  improvement  after  the 
200-th  flight.  Thus,  the  parameters  of  the  model  were  estimated 
separately  for  the  first  200  flights  (see  Figure'  1)  and  for  the 
remaining  601  flights  (see  Figure  2).  In  both  casos,  the  goodness  of 
fit  of  the  model  to  the  data  was  acceptable.  The  horizontal  lines  in 
Figures  1 and  2 are  the  average  failure  probabilities  over  100  flight 
intervals.  Tho  smooth  curves  are  the  estimated  Neibull  failure 


“*  0.1 

probabilities  A8i  . These  curves  are  solid  up  to  the  end  of  the 
data,  and  tho  dash  lines  indicate  the  estimated  future  decrease  in 
failure  probability  if  the  current  rate  of  improvement  were  continued. 
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Proa  the  two  curves  the  reliability  R(i)  ■ l-r(i)  is 
, estimated.  Hie  resulting  reliability  growth  curve  is  shown  in 
Figure  3 with  a juap  at  200.  The  magnitude  of  the  jump  was  calcu- 
lated by  paraaetric  and  nonparametric  means,  and  consultation  with 

the  program  office. 

Me  next  considered  how  the  Weibull  model  could  have  been 
used  to  track  and  project  system  reliability  during  development. 

Using  the  first  200  flights,  the  estimate  of  the  current  reliability 
was  .68  and  the  projected  reliability  at  flight  800  was  .74 
(Figure  1).  This  projection  indicated  that  the  system  reliability 
roquireaent  would  not  bo  net  if  the  present  trend  were  continued. 
There  was  a aajor  re-emphasis  mi  reliability,  and  based  on  the  next 
100  flights  (201-300),  an  estimate  of  the  reliability  at  300  was  .89 
end  a projection  to  800  was  .94  (Figure  4) . This  projection  was  very 
close  to  the  current  estimate  of  .95  for  system  reliability  obtained 
using  all  the  data -on  flights  201-800  (Figure  3). 

Thus,  the  estimation  procedures  provided  e good  guide  as  to 
when  additional  emphasis  should  be  placed  on  reliability,  and  also 
provided  accurate  estimates  of  future  system  reliability  for  each 
phase  of  the  development  prograa. 
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Figure  l Estimate  of  Failure  Probability  for  First  200 
Valid  Flights. 
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Figure  2.  Estimate  of  Failure  Probability  Excluding  First 
200  Valid  Flights. 
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Table  1 Percentage  Po:nfs.  up,  such  that  Prob(LN<up) 
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Figure  3.  Estimated  Reliability  Based  on  Weibull  Model. 
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Figure  4.  Projected  Failure  Probability  Based  on  Flights 
201  to  300. 


MINIMUM  VAR1ANU  SOLUTION  OF  A POLYNOMIAL  FUNCTION 
OF  TWO  NOISY  RANDOM  VARIABLES 


Oren  N.  Dalton 

Mathematical  Services  Branch 
Analysis  and  Computation  Division 
White  Sands  Missile  Range,  New  Mexico 

ABSTRACT 

The  multivariate  analysis  problem  involving  two  random  vector 
variables,  one  a dependent  and  the  other  an  Independent  variable, 
each  variable  noisy,  has  not  been  solved  in  general.  However,  if  the 
two  covariance  matrices  for  the  vector  variables  are  independent  and 
known,  a maximum  likelihood  solution  is  possible  in  certain  non- 
Euclidean  spaces.  This  paper  discusses  an  iterative  technique  for  finding 
the  minimum  variance  solution  to  a problem  in  which  the  independent 
variable  is  an  m*'*1  degree  polynomial  function  of  the  independent  variable, 
and  both  are  normally  distributed.  The  data  is  assumed  to  have  high- 
noise  content  and  to  have  been  obtained,  manually,  from  a graph 
using  a ruler.  Because  of  the  nature  of  the  data,  problems  of  stability 
may  arise.  A method  in  which  control  of  the  excursions  of  the  initial 
estimates  of  the  polynomial  coefficients  by  means  of  an  ad  hoc 
Bayesian  covariance  matrix,  is  included  in  the  derivations,  and  a way 
to  convert  a divergent  problem  to  a convergent  problem  by  means  of 
scaling  is  illustrated.  The  results  for  the  minimum  variance  solution 
of  a third-order  polynomial  math  model,  for  mutually  independent 
measurements  using  data  from  manual  measurements  from  a graph,  is 
Included. 


7*.5 


Z.  INTRODUCTION 


W«  h«vt  tha  preplan  of  wtiaitiBK  tha  pir— tars  for  a Nth  nodal  ralatina 
two  randoa  variables,  aach  subjact  to  nelaa.  That  is , lot  ^ and  ni 

<i  • -i  * * i 
"i  * »i  * 4i 

when*  e.  and  6 . rapraaant  noise.  and  such  that 

i i 

yi  « t(xt)  . 

Par  tha  purpoaa  of  this  paper,  m aasuan  thatf (x^ )la  an  «***  da area  polvnaadal. 
Thus,  for  any  it 

r - pQ  ♦ Pj*  ♦ Pg*2  ♦ • * ♦ ♦ pBx“  . 


Tha  problaa  as  atatad  has  not  boon  solvad  in  ganaral  [1],  but  if  eartaln 
restrictions  ara  aaauaad  a aininun  variance  solution  can  bo  obtainod,  Thasa 
roatrictiona  ara  tha  indapandanos  criteria,  wall-known  to  practitioner*  of  tha 


artt 


EUj*  - £{6^  = 0 

Cov{ c ^ ,t j ) ■ KK^.l,}  = o*J 


Cov{c. ,A , ) * 0 , 

• * 

and  th*  assumption  that  the  i**5  , arm  known.  (This  assumption  can  be 

relaxed  [2],  in  that  tha  variances  can  b«  estimated.  It  is  assumed  hara,  that 
such  estimates,  if  necessary,  have  baan  made.) 

For  tha  polynomial  in  x,  wa  nota  that  if  **1,  a fairly  straitht -forward 
derivation  producas  a quadratic  equation  for  p^,  for  example  saa  [l,p.  258-60]. 
There  are,  however,  several  complications  which  may  arise  oven  in  this  simple 
case  which  ara  enumerated  in  considerable  detail  by  Worth in*  and  Geffner 
[3,  pp.  375-91].  Additional  difficulties  arise  whan  »»1,  and  a short  disscusslon 
of  soeia  of  these  problems  are  also  discussed  by  Worthin*  and  Geffner  [3,pp. 
608-13],  based  mainly  on  tha  work  of  Geary  [6]. 

On  tha  other  hand,  it  was  shown  [5]  that  aost  distributions  commonly  used 
can  ha  daaaified  in  leas  than  half-a-doxen  equivalence  clasaaa  described  by 
ooo-Euclidean  spaces,  to  tha  limit  of  a parameter;  anythin*  true  for  one 
member  of  the  claas  if  true  for  other  members,  or  is  true  elements  in  spaces 
derivable  from  such  spaces.  Thus,  the  non-Euclidean  space  described  as  the 
"•-log"  apace  include*  such  distributions  as  th*  Chi-rquared,  Maxwell, ^arma, 
Itaylaight  and  Normal;  th*  parimenaic  spaces  derivable  from  this  include  the 
•eta,  Student-t  and  Fisher  (F);  th*  derivable  uniform  soaoe  includes  th*  uniform. 
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Exponential  or  Poisson,  Logistic  and  Csuchy. 

Ths  exposition  in  this  pspsr  is  concamsd  with  Normal  variates , but  froa  the 
abort  statements,  it  can  bs  shown  by  extension  [5]  and  [6],  that  the  results 
are  applicable  to  any  of  the  other  distributions.  (Order  statistics,  since 
the  range  of  the  distribution  depends  on  the  seriates,  are,  perforce,  ruled 
out. ) 


II.  NOMENCLATURE  AMP  DEFINITIONS. 

1.  A sector  will  be  represented  asi 

a.  Tt 

b.  The  Dinsc  bra  " < " and  kat  " > " will  only  indicate  row, coluim 
sectors  rasp.  Functions  insolsing  these  symbols  have  no  other  inplication 
than  standard  siatrix  operations.  Thus,  foij  A, a matrix,  «xAs>  is  a bilinear 
fora  for  vectors  x and  y . 

2.  f(x)  represents  a functional  of  the  variables  in  the  vector,  x. 

3.  Matrices  or  vectors  nay  be  defined  as  arrays  whose  elements  are, 
themselves,  arrays.  Thus: 


A 


V 


Ag> 


* 


A > 

■ 
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raprasant a an  mxm  array , each  of  mhone  alaaanta  ia  a vector.  If  aach  A(>  ham 

kj  elements  and  if  , y.  . = v , than  tha  total  aixa  of  A in  tanas  of 

i =1  ' 

scalar  quantities  is,  psrforca,  mkx*. 

а.  | A | represents  the  absolute  valua  of  tha  determinant  of  a matrix,  A. 

5.  ♦>  • * raprasant  a vactor  or  matrix  of  urea , rasp.  Subseriots 
If  applicabla,  will  indicata  dimansion. 

б.  I or  I raprasant  unit  matrices;  tha  dimsnaioo  is  spacifiad  in  tha 
7"dcaaa.  ° 

7.  Soma  oparaters 

a.  r ( x ) indicataa  tha  diagonal  matrix  whose  diagonal  eleamnts  ara 
tha  componants  of  x . Thus,  if 


than 


i 


a 


iaa+tsviM  % \ 


b.  <\(z)  *»  th*  «*«tor  for  th*  k**1  darlvatlw*  of  th*  pelyaoalal 

i>  »«  (Eat**  <Dk(x0)  UpliM  th*  k**1  dtrlvatiw  wctor  of  x «*altiat*4  *t 

r #)  Thu* i 

o 

1)  <D0(*)  * (l  X X2  ...  X*] 

2)  <D1(x)  « [0  1 2x  * * * ax®"1] 


Etc 

«•  Dk(x)  i*  th*  Matrix,  **ch  of  who**  rcw*  1*  th*  kth  darlvatlw* 
of  th*  eomupoodin*  co«pon*nt  f x.  Thuat 


<Dk<*l> 

<Dk(x2) 


(Dk(*0  ) 1*  d*  flood  tiailarly  to  that  in  "c",  abovt.) 


XXX.  THE  PROBLEM  DEFIHITIOH. 

W*  aaaWM  th*t  w*  haw*  two  aoaauroaont  voetom  n , t 
a coopooonta , aoaaurod  indopondontly,  wh*r* 

•> 

n • y ♦ c 


t-J 


• *aoh  with 
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A*  •*'*•*■ 


Cov(e'<e)  * E{ c ><£ ) = 


Cov{6><6}  ■ E{6><6}  * 


and  both  n and  5 havo  a Nomal  distribution.  Purthort 

yl  ’ Po  * plxi  + p2xi  * • • * ♦ P«Xi  • 


Thus  wo  eon  writs: 


whom 


i.  Pn 
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Ms  also  Miun  that  w have  a prior  distribution  which  describes  our  faith 
ia  tbs  initial  estimates  of  tha  parameters  in  P,  also  normally  distributed. 
Thia  ad  hoc  "Bayesian"  adjunct  is  ussd  to  control  tha  aneuraions  on  tha 
adjustment  of  tha  parameters.  The  denser  in  its  usa  liaa  in  tha  fact  that 
for  any  "snail"  variance  on  a oar  ana  tors , wa  nust  ba  sura  that  particular 
paranatar  is  wall -know.  On  ths  othar  hand  [7],  this  "control  function" 

4 can  force  convergence  in  an,  otherwise,  divergent  problem. 

Da  can  than  writ#  tha  liklihood  function  aat 

L » K exp{-  | <eQ^1c>  - \ <6Q’1d>  - | <Apc^1flJ>c>  > 

where 

k . (2.r(n*1**1I/2,i^1i  iq;1!  is^i 

■ P - *o  ' 

where  will  represent  the  initial  estimates  of  tha  parameters,  and 
Qp  will  ba  chosen  as  a diagonal  matrix  whoaa  diagonal  components  are  tha 
variances  assumed  for  the  paraneters  in  p.  It  has  bean  found  that  one 
parameter  is  more  likely  to  ba  known  than  any  of  tha  others.  In  this  oroblsn 

tha  bias  (i.a.j  p ) is  likely  to  La  bast  known,  so  Var  (p  ) will  ba  small. 

° ->  12° 

The  variances  of  tha  othar  tarns  in  P will  ba  sat  to  10  . Ths  affect  of 

thia  procedure  (through  net  on  a polynomial ) was  documented  in  [7]. 


Thus r our  niniaisatien  function,  I, 


bo  written  [I]  sot 


i * <CQy^C>  ^ * V^O^P^^O>  ' 


IV.  tCRIVATIOW  or  THE  H iWIHIZATIOH  fUWCtlOWS.  1 

j 

s 

Tho  approach  toward  minimization  will  bo  bv  tho  not hod  of  steepest  descent 
using  an  itoratlvo  procedure.  That  is,  wo  will  add  adjustments  to  tho  variables  ; 
until  tho  magnitude  of  tho  adjustments  tpnroaehas  soro.  A moment's  reflection 
[9J  indicates  that  wo  will  adjust  tho  independent  variable,  x , and  tho 
parameter  vector,  ?,  until  tho  residuals  e and  t minimise  T.  Since  wo  are 

-e  o * | 

assuming  analyticitv  in  a neighborhood  of  n , t and  PQ  , wo  can  expand 
x and  p in  a Taylor's  expansion,  and  since  the  method  of  steepest  descent 
ie  e first-order  process  we  will  write t 


where  it  la  understood  that  the  "o”  subscript  (except  on  p')  refers  to  the 
value  of  the  variable  during  any  iteration. 

For  the  remainder  of  this  section  we  will  assume  that  n measurements  have 
been  mads  of  the  variables  and  ^ so  that  n,£,y»x  and  Lx. 

are  n-ceaponents  vectors.  The  definition  of  the  order  of  the  polynomial  will 
be  changed  to  m»l  so  that  p,po,fl’po  dp  are  m- component  vectors. 

f.  Charecterlsation  of  e 

We  expand  e in  a Taylor's  series  retaining  the  first  two  terms.  Thus: 
c>  • *0>  4 ( k><e)oAx>  + ( k><e)oAp>  * 

i 3 , 

1.  Derivation  of  ( ax  ><K' 

Let  j ■ B0(x)  • Then 

y 

c>  * n>  - JyP> 


from  which,  using  the  method  developed  by  Dalton  in  [B,  Appendix  C]  for  forming 
the  partial  derivative  of  arrays  with  respect  to  arrays  we  can  write » 


fe  *«  *»  - 


< fe  ><^i 


-t,<  J.) 


Thus,  this  darivativa  it  ? 


"cl{  ix 


V 


c2<p>> 


[ al  ><JylP 


h 


i 

r 

i 

i 

f' 

3 

i 

A 

i 

X 

t 

- - rlDjUJp^ 

i 

Now,  tba  1th  tarn  in  r[D1(x)p>l  it 

<D1(JC1)P> 

♦ 

apd  ainet  p>  * P > + *p»  **  havat 

i 

<D1(xi)p>  - + <Dl(*i)Ap>  * 


<Di(x1)p> 

a 

* ' <B,(x!p> 
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The  term  <D1(xi) 


can  b«  written  aai 


• [ o 1 2xA  3*[  ‘ ‘ (a-l^®”2] 


I 

} 

1 

i. 


*[01  2(xio+Axi)  3(xto+Axi)2  (a-lXxlo+Axi)“  2] 

« [0  1 *xlo+2AXl  ••• 

(m-l)x^”2  + (m-l)(m-2)x”~3Axx  ] 


- <D1(xio)  ♦ Axi<D2^io) 

From  which 

«!.<*!  h»  - <V*loV  * <Vxio)V“i 

+ <D1(xio)Ap>  + <Dg(xio)Ap>Ax1 


If  wa  define 


then  under  the  aegis  of  the  aa sumption  of  analyticity  In  the  neighborhood  of  XQ 

| 


1 


w*  can  drop  second-order  tense  and  writ*: 


r[D1(x)P>]  » t r[D1(Jo)AP>J 


♦ r[D2(xo)po>]r[Axl 


^ j ♦ j , + j „ 

xpo  xpl  xp2 


respectively,  Thus: 


(k><c)  * " JXP  A‘Jxpo  • Jxpl  * JXP2 


2.  Derivation  of  ^ 3p  ><c^ 


<!?><*> 


>(<n  - <pJy) 


B.  Characterisation  of  ft 


Xn  a wanner  similar  to  that  above , we  have  the  following: 


{>  ■ 6 > ♦ ( ><6  )\x>  * ( r-><6)3dp> 

O 9X0  dp  O 


- «Ht  ’V  ill 


* 


( 1“  ><6)  ■ >(<£  - <x)  * - I 

1 3x  1 3x  ' 4 ; n 


— ><6)  . * . 


C.  Charasterization  of  J 

■ — - 3Lm i 

Oiaee  Jy  « Do(x) , expanding  this  similarly  to  the  one  done  above  for 

( |—  > J ) we  have  that 
' 3x  y 


J * d0(x)  a D0<x0)  + rtAx^t^) 


J + AJ 
yo  y , reap. 


D.  Reformulation  of  the  MinimlzaticnFunctional.  I 

We  can  write  c and  £ as: 

e>  ■ c > - rlD  (x)p>]Ax>  - J Ap> 

o l y 


V “ Vx>  " Vp> 


d>  * <5  > - Ax>  . 

o 


Thus; 


I « <eQy1e>  * <fiQ^16>  + <ApQ^1Apo> 


T^-l 


* (<c0  - <AxJxp  - <ApJy)Qy  e>  + (< 


<Ax)Q"15> 


♦ (<?1  ♦ <Ap)Q^1Apo> 


<r. 


-1 


-.-1 


-1. 


S V ’ 2 V*’  - 2 <coS  J/P> 


♦ <AxJ  Ax>  + 2 *AxJ  W~^J  Ap* 

x?y  xp  xpy 

♦ <APjJ1^1jyAp>  ♦ <«0^\> 

- 2<60^[1ax>  ♦ <Ax<£1Ax> 

+ <p1(^lpi>  + 2 <p^p>  + <^1ap> 


Since  ve  wish  to  keep  terns  of  no  higher  than  second-order  we  have: 


1. 


*-1, 


-1, 


2. 

3. 


,-l. 


“.'V  JxpU>  ’ ‘•o'V  JxpoS,,>  * <eoSrJxpl“' 

* <eotVlj,p2lx> 

<'.Q.;ljJ.4P>  • <<'^ljy01P>  * <COl^1‘Jl4P> 


<4xJ  fl  *J  ix>  * <AxJ^CfpJ Ax> 

xpy  xp  xpo  y xpo 


Ji.  <4xJ_fi'1J  Ap>  » <AxJ  Q"\j  Ap> 
xpy  y r xpoy  yo  * 


5.  <ApJy(^1JyAp>  - <ApjyoQ^ijyoAp> 
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Make  the  following  definitions 


K £ - c fl  _:Le  > + <<5  Q-1*  > + <p1«T1p  > 

o o y o o^x  o 1 p 1 


. > ^ q_1e  > 


v 4 <cV  • 


Then: 


t - K 2<e.J  Ax>  - 2<e.J  .Axs  - 2<c . J ~Ax> 

u A xpo  A xpl  A xp2 


:e.J  Ap>  - 2<e,AJ  Ap>  + <AxJQ~  J / 
A yo  r Ay  xpo  y xpo 


♦ <4pJyoay1V'I’>  * 


+ <AxQ~1Ax>  + 2<plQ^1Ap>  + <ApQ^1Ap> 


K.  Minimization  of  1 


I will  be  a minimum  when 


ai  . . ai 

55  > - *>  and  TTt 


3 1 

1.  Derivation  of  rr—  > ■ $> 

3AX 


31 
3 AX 


-J  e,>  - J _e> 

xpo  A xp2  A 


Cl*  3 Ax  >Jxp2}c2{Ax}eA>  - >AJy)cg{ ApkA> 


+ J Q”1J  Ax>  + J a_1J  Ap>  - 26, > + 

xpo  y xpo  xpo  y yo  * A yc 


We  note  that  the  diagonalizing  function  r[ ‘ ] has  the  following 
characteristic: 


r[v]w>  ■ r[v]v>  , v v,w  . 


Thus  we  can  write: 


*•  JXP0EP>  ■ “AW 


■ rt‘AlAl(ioV 

”•  JW1,A>  ■ • r[*AIVJo)V 


= ■ Jxp2*A>  ' rlD2(x0)P0>]r|Aj]et»' 


r[D2(xo)po>]r[cA]Ax> 
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"•  {1<  537  >,xp2,t2(iX,tA> 


“ <!<  — >r!  iI)Ha(r(D2(;o)po>)Ax>(v 


I u 

Define  e,  > as  the  orthonomal  vector.  That  is,  a vector  with  all 

^ a.  k. 

zeros  except  the  1 component  which  is  a one.  Then  if: 

Et  4 V"l 


*•  «1*  337  >Jxp2H2{4xlV 


[Eir[D2(xo)po>]Ax>  * • * t'nr[D2(xo)po>]Ax>]eA> 


Eir^D2(xo)po>^X>  " 


♦> 


AVVxio>V 


* ith  row 


Sl>  337  >Jxp2,t2U,)eA> 


<D2<Xlo,Po>4Xl 


<D2(x2o,Po>4X2  * 


<03(x„)prt>Ax 
2 no  o n 


= r(D2(xo)]r[Ax]rA> 

■ r[cA]r[D2(xo)po>]Ax> 


773 


6-  'il  ‘m  >4y's“p>,v 

■ «!<  !«><r,s;iDi(Io1,kaUplcA> 

» [E1D1(x0)Ap>  • * * EnD1(xo)Ap>]eA> 

EiDl(xo*Ap>  * 


<Di(Xi0)Ap> 

<Dl(x2o)Ap> 

♦ 

<Dl(xno)AP 

- TlD^x^ApjJe^ 

" r^‘A^Dl^*o^p>  * 

If  ve  nake  the  following  definitions 

"i*  * ftWVv  * v 

■ r'Di<Jo>V1V  * v 
S1  4 -Srt*AI  * r[V*o)V1^\V 
®2  4 -!['t'Alr[D2lJo)V1  * ^ 

* r[o1(i0)p0>]^1r[D1(J0)i.0>] 

Then 

ffe>  " ♦ "*  Vp>  + 82&x>  - V 

■ ♦> 


'l(i5  )52(Ap).A» 


*> 

°l(xio)Ap> 

#> 


T7*» 


2.  Derivation  of  --  > * *> 

A A i\  T * 


ai 

SAp 


> » ^ 


“ 3Ap  >(<Vxpl4x>)  ~ jJeeA> 
-2Aj'fCi>  + jT  a"1, 


♦ J 


y”A  ' JyoSr  JxpoAx> 
T 


Jyo%  JyoAp>  + * Q~V  . 

Obeerve  the  following  terms: 

“•  5aF'('Vxp13 4»’)  * |— ■ >(<cArt,'1<»„)4p>lAx>) 


* 9Ap  >(<VlA*]D1(*0)ap’) 


D[(20)r[A£hA>  - D'{'(xo)r[ 


b.  AJ 


,V  ■ J»J(*0)l't4*)cA> 


Muking  the  following  definitions: 

“s'  4 DX>V  * <$V 

S1  4 »X*fViV,>  * <£ 


Then 


i)I 

3Ap 


a 


*^p>  + gJax> 


- I? 


3‘  Vor  Ad>  and  A«>. 

We  have  two  equations  in  two  unknowns, vis . : 


V.  INITIALIZATION. 

Hie  "best”  initial  estimate  for  p is  obtained  (unless  additional 
information  is  available)  from  the  normal  equations.  Thus 


-V  ■-*■?**>*£ 


VI,  AN  ADDITIONAL  METHOD  TOR  CONTROLLING  DIVERGENCE  FOP  THE  CARE:  0^  = 0^  = I 


The  algorithm  darivid  in  the  previous  pagan  was  basad  on  a linear  approximation 
for  the  error  in  an  assumed  analytic  neighborhood  of  the  true  solution.  Inasmuch  as 
both  the  dependent  and  independent  variahlea  are  adjusted  to  achieve  a minimum,  the 
algorithm  la  usually  sensitive  to  noise  and  the  large  excursions  of  the  dependent 
variable.  Indeed,  in  the  form  shown  there  ia  definitely  a tendency  for  the  algorithm 
to  be  unstable  and  to  diverge  with  unbecoming  frequency. 

A large  number  of  empirical  studies  were  made  for  various  kinds  of  polynomials 
under  variou.  conditions  of  noise  content  (in  which  the  basic  premise  for  which  the 
study  wee  intendend,  naanlyt  * o • I,  was  assumed)  and  comparisons  where  made 
among  those  which  diverged  end  those  which  converged.  The  criteria  which  distinguished 
the  convergent  polynomials  from  the  divergent  ones  suprlsed  this  author.  At  this 
writing,  the  author  haa  not  studied  the  theoretical  aspects  and  cannot  supply  the 
reasons.  (One  might  suspect  that  an  investigation  similar  to  that  employed  which 
demonstrates  the  reasons  for  the  instabilities  of  Milne's  integration  method,  might 
help  toward  an  understanding  of  this  problem.) 

Early,  it  was  found  that  if  the  input  were  randomised,  formerly  divergent  problems; 
would  converge,  though  ngt  always,  * Figures  1 to  4 show  a study  in  which  the  algorithm 
diverged  after  randomisation,  (It  diverged  before  randomization,  but  this  result  is 
not  shown. ) 


1.  All  work  was  computed  in  double  precision. 
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A number  of  additional  studies  involving  polynomials  of  various  shapes  disclosed 
the  rather  suprising  fact  that , apparently , the  only  criterion  needed  to  insure  con- 
vergence was  that  the  (scaled)  slope  of  a line  based  on  the  total  height  of  the  first 
ard  last  points  of  the  variables  must  be  less  than  45°,  * if  the  terms  in  the  independ- 

ent variable  were  monotonically  nondecreasing. 

A scaling  criterion  was  introduced  which  cosoared  |$„  +£  jJ  * with  I 1 *T'r>» 
If  (L  < n • the  data  in  p was  replaced  by  s.n  where  % * 0.9  /n  . rirures  S to  lo 
demonstrate  the  effect  on  identical  data  of  the  divergent  problem  ( figure* 1 to  4)  sub- 
ject to  this  scaling  criterion.  In  this  case,  the  algorithm  converged. 

The  data  for  tha  above  studies  wars  generated  from  the  polynomial: 


y ■ 22.5  ♦ 2.125  x - 0.5  x1 2  ♦ 0.03125  x3 


to  which  zero-mean  Gaussian  noise  with  a variance  of  0.02  had  been  added  to  each  of  the 
2n  observations  in  % and  n reap,  further  studies  were  done  in  which  the  variance  of 
tha  noise  was  increased  to  4.0,  which  is  the  same  size  os  the  first  point  o*  the  inde- 
pendent variable,  and  the  domain  of  the  independent  variable  was  decreased  so  that  it 
ranged  through  tha  valued:  4. 0,4, 2,  ...,  13.8  . The  algorithm  still  converged  when  the 
independent  variable  was  rescaled,  but  not  suprisingly,  the  number  of  iterations  inersas 
ed  substantially.  These  results  ars  not  shown. 


1.  It  might  be  of  interest  that  the  first  attest  at  rescaling  was  to  make  £ 

and  each  a unit  vector.  This  didn't  affect  the  divergence  or  convergence 

tendency  at  all. 


Figures  11  to  15  ihoii  tho  eonvtrmoc  of  th«  algorithm  b*md  on  tho  Dolvnoalalt 


f 


y * 12.0  ♦ 7,0  x - 0.2  x2  - 0.1  x3 

in  which  the  minot  was  ayain  0.02.  This  curs*  is  done-shapsd  and  so  oonstructod 
that  and  aro  aufficiontly  cloaa  together  that  tho  data  waa  barolv  rnnealed. 

This  demonstrates  tho,  apparently,  sufficient  crltorion  that  tha  slops  based  on  tho 
first  and  last  point  determine  whether  tho  algorithm  will  converge  or  not. 

Finally,  Figures  Id  to  II  show  tho  convergence  of  tha  algorithm  from  data  obtain- 
od  manually  using  a rulsr  and  a strip  chart.  Furthermore,  tho  curve  from  which  tho 
moasurowsnts  wort  obtained  jittered  over  a width  of  about  3/8  of  an  inch.  Such  data 
is  crude  by  any  standards  for  computer  work,  hut,  hearten in yly,  tho  algorithm  con- 
verged nicely.  Several  other  studies  usiny  data  obtained  in  a similar  manner  from 
similar  sources  always  had  a convergent  algorithm  when  tho  rescaling  criterion  was 
employed. 

VII.  RESULTS  AWP  COWCLUBICNS. 
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This  paper  has  discussed  the  problems  of  a minimum  variance  iterative  Hret-order 
solution  for  two  noisy  vector  variables , a denendent  and  an  independent  one,  in  which 
the  dependant  variable  is  related  to  the  independent  variable  b»  a polynomial.  A 
number  of  atudiaa  were  adde  ueing  e variety  of  polynomial  shapes,  of  detree  three. 

It  was  assumed  that  tha  noise  on  each  independent  observation  was  Gauasian  with 
a aero  mean.  Tha  variance  of  tho  noise  was  varied  from  0.02  to  *.0,  tha  latter  of 
tho  aame  order  of  magnitude  aa  tha  independent  variable. 

Two  criteria  far  assisting  algorithm  convergence  have  been  introduced: 

(1)  Aa  ad  hoe  lay solan  distribution  which  controlled  tha  excursions  of  the 
polynomial  coefficients , 
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(ii)  Rescaling  the  dependent  variable  so  that  the  slope  based  on  th«  first  and 
last  data  point  was  lass  than  45°, 

Of  the  three  criteria  no  studies  have  heen  presented  using  criterion  < i ),  hut 
the  effectiveness  of  this  method  has  Seen  discussed  in  [7j, 

Rather  coaqplate  studies  based  on  criterion  (ii)  were  made  using  a 3 order 

polynomial  in  which  0 » o « I,  That  is,  a least  squares  solution,  ror  all  cases 

^ .... 

tried,  the  alftorithm  converged  evarvtims.  At  nresent,  because  of  the  original  goals 
of , the  problem,  the  uae  of  (ii)  as  e convergence  criteria  is  deemed  adequate  for  the 
requiramanta  of  the  project. 
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THE  PROBABILITY  OF  MOTOR  CASE  RUPTURE 


RONALD  S.  DOWNS 
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US  ARMY  WHITE  SANDS  MISSILE  RANGE 
WHITE  SANDS  MISSILE  RANGE * NEW  MEXICO 


ABSTRACT 


Statistical  Procedures  are  studied  for  the  evaluation  of 
the  probability  that  a motor  case  may  rupture  as  a result 
of  excessive  pressures  exerted  by  the  propellant.  Normally, 
this  study  is  based  upon  two  sets  of  data.  The  first  consists 
of  data  indicating  the  pressure  required  to  burst  a motor  case 
(X),  and  the  second  consists  of  data  indicating  the  maximum 
pressure  exerted  by  the  motor  (Y).  These  data  are  obtained 
from  two  separate  tests;  the  sample  sizes  for  each  test  are 
usually  different;  and  while  X is  usually  tested  under  a fixed 
set  of  conditions,  Y is  frequently  tested  under  a variety  of 
environmental  conditions  and  therefore  makes  use  of  a designed 
experiment.  This  may  be  recognized  as  a special  case  of  the 
problem  of  estimating  component  reliability  from  sample 
measurements  taken  of  the  stresses  applied  and  the  strength 
of  the  component . 

Four  techniques  were  studied  for  the  evaluation  of  the 
probability  of  motor  case  rupture.  All  required  independence 
for  X and  Y;  the  first  requires  normality  for  (X-Y),  the 
second  requires  normality  for  X and  Y,  and  the  third  and 
fourth  require  few  assumptions  concerning  the  distribution 
of  X or  Y.  The  procedures  are: 

1.  One-Sided  Statistical  Tolerance  Limits. 

2.  The  Church-Harris-Downton  (CHD)  Procedure. 


3.  Birnbaum  - McCarty  Procedures. 

4.  The  Chebycheff  Inequality. 

The  last  two  methods  either  provide  unacceptable  results  or 
require  an  unacceptably  large  sample  size.  Either  of^the 
first  two  methods  can  provide  acceptable  results  with  a 
reasonable  sample  size  if  the  assumptions  of  normality  can 
be  considered  valid.  Of  these  first  two,  the  CHD  method 
appears  to  give  the  narrower  confidence  limits,  but  the 
tolerance  limit  method  may  be  preferable  for  small  samples. 
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Actual  test  data  was  used  to  test  the  assumptions  underlying 
the  various  methods.  From  this  data,  it  was  concluded  that 
X may  be  distributed  almost  as  the  normal,  but  there  is  some 
evidence  that  Y and  (X-Y)  may  deviate  from  normality.  The 
implications  of  these  deviations  are  discussed. 


THE  PROBABILITY  OF  MOTOR  CASE  RUPTURE 


1.  INTRODUCTION. 

a.  An  important  problem  when  evaluating  the  safety  of  a 
missile  or  rocket  system  is  the  determination,  at  a suitable 
level  of  confidence,  that  the  probability  of  the  motor  case 
rupturing  is  less  than  some  pre-determined  small  value. 
Normally,  an  estimate  6f  the  probability  of  case  rupture  will 
be  obtained  from  two  tests,  each  limited  to  relatively  small 
samples.  The  first  teBt  will  be  to  determine  the  maximum 
pressure  (Y)  exerted  by  the  motor.  The  second  test  will  be 

to  determine  the  pressure  (X)  required  to  burst  the  motor  case. 
The  maximum  pressure  tests  will  frequently  be  conducted  under 
a variety  of  environmental  conditions  and  will,  therefore, 
make  use  of  a designed  experiment,  while  the  motor  case  tests 
will  nearly  always  be  conducted  under  a fixed  set  of  environ- 
mental conditions.  The  first  point  is  mentioned,  because,  for 
most  maximum  pressure  tests,  the  degrees  of  freedom  cannot  be 
expected  to  be  one  less  than  the  sample  size. 

b.  The  case  rupture  problem  is  a special  case  of  the 
well-known  problem  in  which  the  reliability  of  a component 

is  estimated  by  determining  the  probability  that  the  strength 
(X)  of  the  component  exceeds  the  stresses  (Y)  which  are  exerted 
on  the  component.  The  main  difference  lies  in  the  fact  that 
the  examples  and  supporting  data  of  this  report  will  be  related 
entirely  to  the  motor  case  problem. 

c.  The  following  methods  for  solving  this  problem  are 
discussed; 

(1)  The  Statistical  Tolerance  Limits. 

(2)  Church-Harris-Downton  (CHD)  Procedure. 

(3)  Birnbaum  - McCarty  Procedure. 

(•*)  The  Chebycheff  Inequality. 

The  advantages  and  disadvantages  of  each  procedure  is  discussed 
and  actual  test  data  is  used  to  evaluate  the  assumptions  of 
these  procedures. 
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2.  STATISTICAL  TOLERANCE  LIMITS 


a.  A procedure  which  has  been  uBed  for  some  time  for  the 
solution  of  case  rupture  problems  has  been  that  of  statistical 
tolerance  limits.  If  it  can  be  shown,  at  the  y%  level  of 
confidence,  that  (X-Y)  < 0 no  more  than  e%  of  the  time,  then  it 
is  clear  that  at  the  Y%""level  of  confidence,  the  probability 
of  case  rupture  does  not  exceed  e%.  Making  use  of  a table  of 
one-sided  normal,  tolerance  limits,  such  as  Reference  a,  this 
can  be  determined  if i 

(1)  X-Y  is  normally  distributed 

(2)  X-Y  is  known 


[ (3)  n^y  can  be  determined 

(4)  S can  be  determined 

x-y 

(5)  Degrees  of  Freedom:  f can  be  determined 

i x-y 

i 

[ 

b.  If  the  assumption  of  normality  is  valid,  the  required 
| information  is  available  from  the  following  formulae: 


(1) 

JPY  = X-Y 

F(l) 

(2) 

S 1 a S 1 + S * 

F(  2) 

x-y  x y 

S * + s„* 

(3) 

n s -5 JL. 

F(  3) 

x-y  s * s * 

x 


n 


n. 


x y 

(Note  Appendix  2 for  discussion) 


<s  * ♦ S *>* 

— S i - 2 . 


<">  fx-y  s. 


s * 

ft?  * Hr? 

x y 


FC4) 


(Note  Reference  b) 
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o.  Referring  to  Example  A,  Appendix  1,  the  following 
can  be  computed: 

(1)  7-7  = 2500 

(2)  Sv  „ = (250) 2 + (450) 2 * 514.78 

x— y 

(3)  (7-7) /S  = 4.856 

x— y 

<4)  nx.y  = 9*22 

(5)  f = 14.31 

d.  Referring  to  pages  180  and  182  of  Reference  a,  and 
performing  several  interpolations,  it  can  be  determined  that 
at  the  90%  level  of  confidence,  the  probability  of  a case 
rupture  does  not  exceed  3.30  * 10”1*. 

e.  Formulas  F(3)  and  F(4)  frequently  provide  fractional 
answers.  One  may  proceed  by  rounding  the  fractions  to  the 
nearest  integers  and  computing  the  desired  probability,  or 
the  fractions  can  be  retained  and  the  solution  can  involve 
extensive  interpolation. 

f.  Advantages, 

(1)  At  a given  level  of  confidence,  low  probabilities 

of  case  rupture  can  be  obtained  with  relatively  small  samples. 

(2)  If  suitable  tolerance  limits  tables  are  available, 
the  procedure  is  relatively  simple  to  apply  (especially  simple 
if  f and  n are  integers). 

g.  Disadvantages. 

(1)  The  procedure  is  sensitive  to  deviations  from 
normality. 

(2)  As  will  be  shown  in  Section  3,  the  Church-Harris- 
Downton  procedure  generally  provides  lower  probabilities  of 
case  rupture  than  the  tolerance  limit  procedure. 
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3.  THE  CHURCH-HARRIS-DOWNTON  (CHD)  PROCEDURE. 

a.  The  Church-Harris-Downton  (CHD)  Procedure  evolved 
through  three  journal  articles.  References  c through  e,and 
was  developed  to  determine,  at  a suitable  level  of  confidence, 
the  probability  that  X > Y.  It  was  specifically  developed  to 
evaluate  the  reliability  of  a component  baaed  upon  its  strength 
and  the  stresses  it  must  undergo. 

(1)  The  confidence  limit  statement: 

P {♦[V-*-,(l-a/2)dJ<R<*[Vt*“x(l-a/2)tf  ]>  * l-a  F<5) 

I*  V V 


(2)  V* 


7-7 


V*n5x'r*W 


; ♦<V)  a r (the  point  estimate)  F(6) 


(3) 


2 - 


c s *♦  c s * 

n x my 


■>  « 

/V'V  . Cm‘-Sv'\ 

v , v . iwnr  ~rr) 


n 


J(Cnsx’*Cn,V>! 


F(7) 


b,  The  following  explanations  of  F(5)  through  F(7)  are 
given: 


(1)  F(5)  is  for  two-sided  confidence  limits, 

sided,  replace  a/2  with  o. 


For  one- 


(2)  ♦ refers  to  the  cumulative  normal  and  *“I  to  the 

inverse  cumulative  normal.  Selected  values  of  ♦"*  are  provided 
in  Appendix  3. 


(3) 


Cn  is  a constant  depending  upon  n 


x ">«  Cra  on  V 


These 


constants  were  developed  by  Dr.  F.  Downton,  Reference  3,  and  may 
be  found  in  Appendix  3,  The  constants  could  be  replaced  by  one 
without  greatly  affecting  the  results. 

o.  To  solve  Example  A,  Appendix  1,  by  the  CHD  Method: 

(1)  V * 5.01922  and  a a /l.0754  a 1,037  . 
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(2)  For  90%,  one-sided  confidence  limits,  (1-a)  = 

1.28155  . 

(3)  Pr{4>[5.019  - 1.282  x 1.037]  < R}  r 90% 

■ \ 

Pr(*(3.690)  < R)  = 90% 

P {.999888  < R}  = 90% 
r 

90%  confidence  that  the  probability  of  case  rupture  < 1.12  x 10“ 4 . 

d.  Advantages. 

(1)  At  a given  level  of  confidence,  low  probabilities  of 
case  rupture  can  be  obtained  with  relatively  small  samples. 

The  CHD  method  generally  appears  tc  provide  narrower  confidence 
limits  than  the  tolerance  limit  method.  Paragraph  2 of  Appendix 
1 gives  the  results  for  five  examples  and  it  can  be  seen  that 

in  each  case  the  CHD  method  gives  probabilities  ranging  from  1/2 
to  1/3  of  those  obtained  by  the  tolerance  limit  method. 

(2)  Aside  from  the  complexity  of  Formula  F(7),  the  CHD 
method  is  relatively  simple  to  use.  About  the  only  tables 
required  are  Tables  of  C and  good  tables  of  the  cumulative 
normal . 

e.  Disadvantages, 

<1)  The  CHD  method  is  sensitive  to  deviations  from 
normality. 

(2)  The  CHD  method  uses  the  asymptotic  normal  approximation 
of  a given  statistic,  and  requires  substitution  of  the  population 
means  and  standard  deviations  by  their  observed  sample  values. 

For  these  reasons,  the  method  of  statistical  tolerance  limits 
may  be  preferable  when  dealing  with  small  samples. 

4.  BIRNBAUM  - McCARTY  STATISTICS. 

a.  Birnbaum  - McCarty  statistics  provide  a non-parametric 
procedure  For  determining,  at  a given  level  of  confidence,  that 
X<Y.  This  procedure  is  relatively  simple  and  involves  computing 
P = W/*x«ii  ),  where  U is  the  number  of  pairs  of  x and  y for 

which  x<y.  Tt  is  then  possible  to  make  the  following  statement: 
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P (p<$+e}>Y>  where  e depends  upon  n , n , and  y»  and  can  be 
obtained  from  the  Tables  on  pages  323  and  324  of  Reference  f. 

b.  To  illustrate  this  procedure,  Example  A of  Appendix  1, 

will  again  be  used.  It  will  be  assumed  that  the  smallest  x in 

the  sample  is  larger  than  the  largest  y,  resulting  in  U * p * 0. 

From  page  323  of  Reference  f,  it  can  be  seen  that  £ * 0.609  for 

n = 10,  n = 9,  and  y ~ 9'0%.  Thus,  P [(Y<X)  < .609]  > 90%, 
x y r — — 

which  is  to  say  that  at  the  90%  level  of  confidence,  the 

probability  of  case  rupture  is  less  than  60.9%.  (Note  Para  2, 

Appendix  1,  for  other  examples.) 

c.  It  is  of  further  interest  that  if  Birnbaum  - McCarty 

statistics  were  used  to  verify,  at  the  90%  level  of  confidence, 

that  the  probability  of  case  rupture  did  not  exceed  .005,  it 

would  require  n„=  n = 140,111  . 
x y 

d.  Advantages.  The  only  requirement  is  that  X and  Y must  , 
be  independent  random  variables. 

e.  Disadvantages. 

(1)  To  provide  probabilities  which  are  at  all  useful, 
completely  unrealistic  sample  sizes  are  required. 

(2)  Regardless  of  how  widely  separated  3?  and  7 may  be,  as 
long  as  the  two  samples  do  not  overlap,  it  does  not  improve 
the  probabilities. 

5.  THE  CHEBYCHEFF  INEQUALITY. 

a.  Since  Birnbaum  - McCarty  statistics  do  not  appear  to 
provide  a reasonable  solution  for  the  motor  case  problem,  the 
Chebycheff  Inequality  is  offered  as  a possible  procedure  when 
there  is  reason  to  belive  that  the  assumption  of  nonsality  may 
not  be  valid. 

b.  This  procedure  is  discussed  below,  and  Example  A is 
again  used  to  illustrate  the  method. 

(1)  To  Chebycheff  Inequality  is  given  by  F<8)  and  F(9) 
below.  The  procedure  for  converting  the  inequality  from  a 
well  known  form  to  F(3)  is  provided  by  Appendix  IV. 
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P(X>Y)  > 1 - l/K* 
K = 


y - u 

x Hy 


r<8) 

F(9) 


(2)  Using  ths  data  for  Example  A from  Appendix  lt  and  the 
valuas  nx_y  ~ 9*22  and  » 16.31  from  Para  2c,  the  following 

is  determined: 

(a)  95%  LCL  for  y - y ■ : 

* y 


ys  «*i  ‘ 

Jhrrh  5 2600  - U.»«9>(8*.333> 


2356.642 

(b)  95%  UCL  for  o 


x-y 


/«S,'tV)  _ /16.308  • 

V *.95,f  J 8. 


-265^000  _ 726.809 


181 

(c)  90%  LCL  for  K = “ 3.242 

(d)  90%  LCL  for  Prob  <Y<X)  = 1 - l/K*  = 1 - 1/(3.242)*  = 
1 - .0951  = .905. 


(e)  Assuming  (X-Y)  is  continuous,  unimodal,  and  symmetric, 
90%  LCL  for  Prob  (Y<X)  * 1 - 2/9K*  * 1 - .0211  = .979 


c.  Advantages. 

(1)  Except  for  the  application  of  t and  xa  tests,  the 
procedure  is  completely  distribution  free. 

(2)  The  procedure  is  relatively  simple  to  apply. 

(3)  Referring  to  the  Table  in  Para  2 of  Appendix  1,  it 
appears  that  the  Chebycheff  procedure  provides  better  results 
than  Bimbaua  - McCarty,  and  if  (X-Y)  is  '.ontinuous , unimodal, 
and  symmetric,  it  is  possible  to  improve  even  more. 
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d.  Disadvantages. 

(1)  The  t and  x2  procedures  are  based  upon  the  assumption 
of  normality. 

(2)  While  the  results  are  better  than  Birnbaum  - McCarty, 
they  are  still  not  adequate  for  most  applications. 

(3)  If  it  is  desired  to  use  the  further  refinement  of 
continuity,  unitoodal,  and  symmetry,  this  will  be  about  as 
difficult  to  verify  as  normality,  and  it  really  doesn't  buy 
nearly  as  much  as  the  assumption  of  normality. 

6.  OTHER  PROCEDURES.  The  four  methods  determining  motor 
safety,  which  are  discussed  in  this  report,  are  far  from 
exhaustive.  Three  additional  methods  are  briefly  discussed 
below: 

i i ■ 

a.  Reference  m uses  the  Chebycheff  inequality  and  the 
Van  Dantzig  upper  bound  for  the  variance  of  the  Mahn-Whitney 
statistic  U to  provide  distribution  free  confidence  intervals 
for  the  probability  (Y<X) . This  method  generally  gives  better 
results  than  Birnbaum-McCarty  statistics  but  not  as  good  as 
the  Chebycheff  procedures  of  Section  5 of  this  report. 

b.  Paragraph  4 of  Reference  j provides  a procedure  which 
can  be  used  If  X and  Y are  independent  and  both  are  normally 
distributed.  This  method  provides  results  which  are  about 
the  same  as  the  Tolerance  Limit  Method,  but  has  the  following 
limitations: 

(1)  o and  a must  be  equal. 

x y 

(2)  n and  n must  be  equal. 

x y 

(3)  The  published  tables  do  not  go  below  10  or  above 
100  for  sample  size. 

(4)  The  published  tables  do  not  provide  probabilities 
of  less  than  one  percent. 

c.  Reference  1 provides  procedures  based  upon  similar 
concepts  to  those  discussed  in  6b  and  provides  some  additional 
methods  for  solving  the  case  rupture  problem. 
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c.  Actual  Test  Data. 


(1)  A search  was  made  for  actual  test  data,  and  while 
all  the  data  that  was  located  was  classified,  burst  pressure 
data  (X)  was  located  from  three  tests  and  maximum  generated 
pressure  data  (Y)  was  located  from  20  tests.  Table  1 provides: 
sample  size)  the  coefficient  of  skewness  (y^);  the  coefficient 

of  kurtosis  (y  >)  and  the  significance  level  using  the  Shapiro- 

Wilk  procedure  to  test  for  deviations  from  normality.  (Note 
Reference  i for  a discussion  of  the  Shapiro-Wilk  Test.) 


(2)  From  Table  1: 

(a)  Burst  Pressure  Data. 

1^  Neither  y nor  y^  is  significantly  different  from  zero 
for  any  of  the  three  tests. 

2*  y is  small  in  all  cases  and  positive  for  two. 

Y^  is  negative  for  all  three  testB. 

4_.  The  Shapiro-Wilk  test  gives  no  indication  of  deviation 
f rom”normal i ty . 

£.  For  burst  pressure,  the  assumption  of  normality  appears 
reasonable. 

(b)  Maximum  Generated  Pressure. 

1,  For  Test  #5,  yj  is  significantly  different  from  zero 

at  the  5%  level  and  v at  ihe  1%  level.  Neither  y nor  y is 

2 12 

significantly  different  from  zero  for  any  of  the  other  19  tests. 

2,  From  the  Shapiro-Wilk  Test,  #5  deviated  significantly 
from“normality  at  the  1%  level  while  tests  11  and  14  showed 
deviation  at  the  10%  level.  For  the  other  17  teste,  there  was 
no  indication  of  a significant  deviation. 

3.  The  signs  for  y and  y are  about  evenly  divided  between 

positive  and  negative.  1 2 

4.  There  is  little  indication  that  (Y)  data  deviates 
signTficantly  from  normality,  but  there  is  enough  questionable 
^lata  that  further  study  appears  desirable. 
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TABLE  1 


t 

PROPERTIES  OF  3 SAMPLES  FOR  BURST  PRESSURE  (X) 

< AID  20  SAMPLES  FOR  MAXH4JM  GENERATED  PRESSURE  (Y) 


TOST 

SAMF1E 

SHAFUV>> 

WPE  OP  TEST 

NO. 

SIZE 

7 

1 

■* 

\ 

WILK  TBS 

BURST  PRESSURE  (X) 

1 

7 

-0.070 

-1.193 

.90 

2 

10 

0.162 

-0.284 

.50 

3 

10 

0.118 

-0.505 

.98 

MAX  GBN  PRESSURE  <Y) 

1 

7 

—0.86 

0.723 

.27 

2 

8 

-0.17 

0.162 

.61 

3 

12 

0.088 

-1.124 

.27 

It 

12 

0.681 

0.245 

.40 

5 

10 

-1.849** 

5.263*** 

.01*»* 

6 

12 

0.285 

-1.330 

.35 

7 

10 

-0.394 

-1.502 

.27 

8 

4 

0.840 

2.289 

• 36 

9 

5 

0.992 

2.8 6i 

.32 

10 

6 

1.053 

2.727 

.34 

11 

5 

0.405 

-3.034 

•09* 

12 

6 

0.145 

-0.908 

.98 

13 

6 

0.447 

0.454 

.59 

14 

4 

1.047 

3.484 

.08* 

15 

4 

0.317 

-0.849 

.81 

16 

4 

-0.771 

1.916 

.42 

17 

4 

0.000 

1.500 

.57 

18 

8 

-0.331 

1.634 

.75 

19 

6 

0.088 

-1.900 

.57 

20 

6 

-0.105 

-1.900 

.25 

•Slffilflcance  at  10< 

level. 

••Significance  at  51  level. 
•••Significance  at  1*  level. 


« 
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(3)  The  small  samples  associated  with  the  tests  in  Table  1 
provide  a serious  handicap  in  evaluating  normality.  Therefore, 
the  following  was  performed  in  an  attempt  to  obtain  a look  at  a 
larger  sample : 

(a)  The  data  from  the  three  "burst  pressure"  samples  were 
transformed  to  provide  samples  with  7*0  and  Sx*l.  The  data 

was  then  cos&ined  to  form  a single  sample  with  n * 27,  7 * 0, 
and  Sx  * 1. 

(b)  The  data  from  the  20  "maximum  generated  pressure" 
samples  were  transformed  to  provide  samples  with  7*0  and 
Sy  * 1.  The  data  was  then  combined  to  form  a single  sample 

with  n * 139,  7*0,  and  Sy  * 1. 

(c)  The  data  discussed  in  Para  (a)  was  again  transformed 
such  that  7 * 4000  and  S * 300.  The  data  in  Para  (b)  was 
transformed  to  have  7 * 2000  and  S * 75.  Then  a sample  of  200 
values  of  X-Y  was  obtained  by  randomly  matching  values  of  X 
and  Y and  computing  their  differences. 

(d)  The  sample  size,  mean,  standard  deviation,  coefficient 
of  skewness,  coefficient  of  kurtosis,  their  standard  deviations, 
and  normality  test  information  are  provided  in  Table  2 for  the 
combined  samples  of  X,  Y,  and  (X-Y). 

TABLE  2 


PROPERTIES  OF  SIMULATED  SAMPLES  FOR  BURST  PRESSURE  (X), 
MAXIMUM  GENERATED  PRESSURE  (Y),  AND  (X-Y) 


TYPE 

OF 

DATA 

SAMPLE 

SIZE 

MEAN 

ST. 

DEV. 

Yi 

Ya 

s\ 

TEST  FOR 
NORMALITY 

X 

27 

.0002 

0.961 

.0887 

.471 

-.0811 

.578 

.43 

Y 

139 

.0008 

0.923 

-.124 

.208 

-.568 

.408 

.11 

X-Y 

200 

2026 

287 

-.004 

.17? 

-.978*** 

.342 

.025** 

••Significant  at  the  51- level. 
•••Significant  at  the  II  level. 


*The  Shapiro-Wilk  Test  was  used  to  test  for  normality  of  X. 

The  x*  test  was  used  to  test  for  normality  of  Y and  X-Y. 
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(4)  It  is  possible  that  the  procedures  introduced  in 
developing  the  samples  of  Table  2 may  have  introduced  some 
biases  in  the  above  statistics,  but  the  following  may  be 
drawn  from  this  Table. 

(a)  Burst  Pressure  Data  (X) 

1.  Both  y and  v are.  small  and  have  the  desired  sign. 

- » * * 

2.  The  Shapiro-Wilk  Test  indicates  no  significant 
deviation  from  normality. 

The  assumption  of  normality  appears  to  be  valid. 

(b)  Maximum  Generated  Pressure  (Y) 

1.  y is  small  and  possesses  the  desired  sign. 

2 . Yt  differs  from  zero  at  about  the  17%  level  of 
significance.  Y£  possesses  the  desired  sign  if  not  zero. 

3.  Y deviates  from  normality  at  the  11%  level  of 
significance,  using  the  xl  test  for  goodness  of  fit. 

4.  While  y and  Y2  possess  the  desired  signs,  there 

remains  some  concern  about  the  assumption  of  normality  and 
the  behavior  of  the  distribution  in  the  region  of  the  tails. 
Fortunately,  the  variance  of  Y is  usually  significantly 
smaller  than  the  variance  for  X. 

(c)  (X-Y) 

,1.  y is  essentially  zero. 

J2.  y2  is  large  and  differs  from  zero  at  the  1%  level  of 

significance.  This  may  reflect  the  procedure  used  instead  of 
the  behavior  of  the  distribution  of  (X-Y5.  Fortunately,  y is 
negative . * 

<3.  Utiing  the  xa  test  for  goodness  of  fit,  (X-Y)  deviates 
from~normality  at  the  2 1/2%  level  of  significance.  This  can 
be  attributed  to  the  large  negative  value  of  y . 
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4_.  There  may  be  considerable  question  concerning  the 
normality  for  (X-Y) , especially  since  normality  for  Y is 
questionable.  This  apparent  lack  of  normality  appears  to  be 
caused  by  the  large,  negative  value  for  the  coefficient  of 
kurtosis,  and  this  may  be  caused  by  the  procedure  for 
constructing  the  sample  rather  than  the  actual  behavior  of 
(X-Y).  While  a negative  value  for  the  coefficient  of  kurtosis 
may  be  preferable  to  a positive  value,  there  still  remains 
the  question  of  the  behavior  of  the  distribution  in  the 
vicinity  of  the  tails. 

8.  CONCLUSIONS. 


a.  Birnbaum-McCarty  and  Chebycheff  Inequality  procedures 
are  desirable  because  of  their  distribution  free  characteristics. 
However,  each  provides  either  unsatisfactory  confidence  limits 

or  requires  unrealistic  sample  sizes. 

b.  Both  the  statistical  tolerance  limits  and  the  Church- 
Harris-Downton  methods  require  assumptions  of  independence 
for  X and  Y and  normality  for  each.  If  the  above  assumptions 
are  valid,  either  can  be  expected  to  provide  satisfactory 
confidence  limits  with  a reasonable  sample.  Of  the  two,  the 
Church-Harris-Downton  method  appears  to  provide  narrower 
confidence  limits,  but  may  be  less  suitable  for  small  samples. 

c.  A study  of  actual  data  suggests: 

(1)  The  assumption  of  normality  appears  to  be  reasonable 
for  pressure  required  to  burst  the  case  (X). 

(2)  The  assumption  of  normality  may  be  questionable  for 

the  maximum  pressure  (Y)  and  the  difference  (X-Y)  thus  suggesting 
considerable  caution  when  applying  these  procedures. 
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APPENDIX  1 


EXAMPLES 


1.  The  following  examples  are  used  for  illustrative  and 
for  comparison  purposes.  Example  A is  used  for  demonstrating 
how  the  four  procedures  of  this  report  are  applied.  Note 
that  for  Example  A,  a designed  experiment  was  used  to  evaluate 
the  maximum  pressure  exerted  by  the  propellant,  resulting  in 
a sample  of  10  with  17  degrees  of  freedom.  In  all  other 
examples,  the  degrees  of  freedom  are  (n-1). 


PRESSURE  REQUIRED 
TO  BURST  CASE 

MAXIMUM  PRESSURE 
EXERTED  BY  PROPELLANT 

Example  A 

7 = 6000  psi 

7 a 3500  psi 

S * 450  psi 

Sy  = 250  psi 

nx  * 9 

ny  a io 

f S 8 

X 

fy  a 17 

Example  B 
7 a 6000  psi 

7 a 3500  psi 

Sx  S 400  psi 

Sy  a 400  psi 

“*  ■ 16 

ny  = 15 

fx . » 

f a 14 

y 

Example  C 
7 = 6000  psi 

7 a 3500  psi 

Sx  = 400  psi 

Sy  a 200  psi 

O 

n 

n 

cx 

ny  a 25 

f * 9 
X 

fy  a 24 
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Example  D 
7 = 6000  psi 


7 = 3500  psi 


Sx  = 450  psi 


Sy  = 250  psi 


nx  = 25 


fx  = 24 


Example  E 
X = 6000  psi 


Sx=  500  psi 


nx  = 8 


fx  = 7 


ny  =10 

fy  s 9 


7 = 3500  psi 
Sy  = 125  psi 

ny  * 8 
fy  * 7 


2.  The  following  table  provides  90%,  one-sided  confidence 
limits  that  the  probability  of  caBe  rupture  will  not  exceed 
the  listed  value  for  each  procedure. 


PROCEDURE 

IBUH 

CHURCH-HARRIS 

BIHNBAUM- 

2/9  x 

CHEBYCHEFF 

LIMITS 

DOWNTON 

McCARTY1 

INEQUALITY 

INEQUALITY2 

3.30  x 10"' 

1.12  x icr' 

6.09  x 10"* 

9.51  x 10"* 

2.11  x 10"* 

2.92  x i<r* 

1.10  x 10"' 

4.83  x 1(T‘ 

10.24  x 10“* 

2.28  x IO"2 

2.91  x 10"* 

8.54  x 10"* 

4.88  x 10“* 

8.10  x 10“* 

1.80  x l<r2 

5.05  X icr* 

2.00  x 10"» 

4.88  x 10"1 

7.86  x 10"* 

1.75  x lor* 

9.18  x icr' 

4.28  x 10“' 

6.62  x io"1 

16.46  x 10“* 

3.66  x 10“* 

‘When  applying  the  BimbauTV-Hcf'n rty  procedure , It  Is  assumed  that  the 
smallest  X In  the  sanple  is  larger  than  the  largest  Y In  Its  sanple. 

‘Using  «?/9  of  the  Chebycheff  values  is  Justified  If  (X-Y)  Is  continuous, 
unlmodal,  and  symmetric. 
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SEuSCTXnQ  he  sampus  size  for  the  statistical  tolerance  LIMIT  fEIHOD 


1.  When  applying  statistical  tolerance  limits  to  determine  the  probability 
that  X-Y>0,  It  is  necessary  to  determine  a sample  size,  nx-y,  to  be  used  In 

the  computation.  If  nx  - r^,  then  slnply  set  nx_y  " nx  * riy.  If  ^ n^, 

a safe  procedure  Is  to  set  n^_y  to  the  smaller  of  (nx,  ny).  However  after 

sons  consideration,  F(3)  was  decided  upon: 


"x-y 


S * + s 1 
°x 

"x  *V 


P(3) 


2.  The  procedure  used  in  determining  F(3)  was  as  follows: 

a.  The  t test  for  the  equality  of  two  means  with  unequal  variances: 

(T-Y)  - (u„  - 


t s 


/S3T 
/ n,  \ 


b.  If  nx  - ^ ■ n,  the  formula  obviously  becomes: 


(Y-Y)  - (ux  - u ) 

— ■— 


s * 

JL. 


o.  Equating  the  two  and  solving  for  n gives  F(3). 

3.  The  above  procedure  cannot  be  considered  more  than  a plausible  reason 
for  P(3);  however,  F(3)  does  have  the  following  desirable  attributes: 

a.  If  n*  ■ r^,  then  nx-y  ■ - riy. 


b. 


IfSx- 


then  n„  „ la  the  harmonic  mean  of  n.  and  n. 


x-y 


y* 


c.  n^y  la  bounded  by  nx  and  r^. 

Sx  > sy.  then  nx_y  will  be  closer  to  nx  than  r^,  and  this  is 

desirable  since  the  larger  S has  the  greater  Influence  on  S in  the 
formula:  *~y 


APPENDIX  3 

TABLES  REQUIRED  FOR  THE  CHURCH-HARRIS-DOWNTON  <CHD)  PROCEDURE 

1.  Values  of  C , taken  from  p.  554  of  Reference  e,  are  listed  below.1 
n 7 


n 

cn 

n 

cn 

n 

cn 

4 

0.9955 

17 

0.9560 

30 

0.9719 

5 

0.9456 

18 

0.9578 

31 

0.9727 

6 

0.9334 

19 

0.9595 

32 

0.9734 

7 

0.9314 

20 

0.9610 

33 

0.9741 

8 

0.9328 

21 

0.9625 

34 

0.9748 

9 

0.9355 

22 

0.9638 

35 

0.9754 

10 

0.9384 

23 

C.9651 

36 

0.9760 

11 

0.9414 

24 

0.9663 

37 

0.9766 

12 

0.9443 

25 

0.9674 

38 

0.9772 

13 

0.9470 

26 

0.9684 

39 

0.9777 

14 

0.9495 

27 

0.9694 

40 

0.S782 

15 

0.9519 

28 

0.9703 

- 

- 

16 

0.9540 

29 

0.9711 

- 

- 

Far  large  n,  c ^ 1 

- 1/n  + 6/n2 

- 60/n*  ♦ 0<n“%> 

• 

‘This  Table  has  been  included  with  the  permission  of  the  author. 
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2.  Values  far  ♦”l  (Inverse  Cumulative  Normal) 


CONFIDENCE 

ONE 

TWO 

LEVEL 

SIDED 

SIDES 

0.70 

.6745 

1.0364 

0.80 

.8461 

1.2816 

0.85 

1.0364 

2.4395 

0.90 

1.2816 

1.6449 

0.95 

1.6449 

1.9600 

0.975 

1.9600 

2.2482 

0.99 

2.3264 

2.5758 

0.995 

2.5758 

2.8130 

3.  For  values  of  any  table  of  emulative  normal  should  be  adequate. 
The  tables  on  p.  3-1 Q and  an  p.13  of  Reference  f could  be  useful  since 
these  cover  values  of  <t>  from  0-500. 

4.  Using  the  above,  V and  a can  easily  be  computed,  note  Example  A, 

Appendix  1 . ' 


b.  ov* 


C S 2 ♦ C S 2 
n x my 


6000-3500 

f . 9355(450)*  ♦ .9$84^$ti; 

/VV 

< , V . CX-7)  \ fK 


5 5.01922  F(6) 


c 

2 


2(Cn4 Sx*  + Wy'>* 
<450)2  + (250)a 


(.9355MV50)2  «•  (.9384X250) 


/(.9355)2(450)2  . ( ,9384)2(250)2 
+ (2500)2  \ 1 " " * 17 

2{(.9355X350>2  ♦ (.9384X250)2}2 


= 1.0817 


oy  = 1.0400 
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APPENDIX  i| 

•am  CHEBJfCHEPP  INEQUALITY 


1.  Ihe  usual  statement:  of  the  Chebycheff  Inequality: 

a.  P[|Z  - u2|  > b]  < o2a/b  4 ,! . 

b.  P[-b  < (Z  - y2)  < b]  > 1 - o2a/b  1 

c.  P[(Z  - u ) > - b]  > 1 - o 2/b  2 

£»  — Z 

Since  the  inequality  has  changed  from  two-sided  to  one-sided,  it  would 
appear  reasonable  to  write  the  right  side  of  lc  as  1 - oa/2b*;  however, 
this  does  not  appear  to  bo  Justified  since  there  is  no  assurance  of 
symmetry. 

2.  letting  Z ■ X-Y,  lc  can  be  written: 


a.  P{ [ (X-Y)  - (u  -u  )]  > -(u  -u  )}  > 1 - — — JL. 

y y " W’ 

b.  Subtracting  (u-u)  from  both  sides  of  the  inequality  within  the 
brackets  and  letting  * 


PC (X-Y)  > 0]  > 1 - 1/K2 

c.  P(X>Y)  > 1 - 1/K2  or 

d,  P(X<Y)  < 1/K2 


F(9) 


F(8) 


3.  According  to  Reference  k,  page  293,  when  (X-Y)  is  continuous,  unimodal, 
ana  symmetric,  and  sinoe  this  is  a one-sided  inequality,  2d  could  be  written: 

r(X<Y)  < 2/9K* 
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ON  TOE  NONEXISTENCE  OF  SOW 
INGOWLETE  BLOCK  DESIGNS 

Alan  V.  Ban ton 

U.  S,  Amy  Material  System  Analysis  Activity 
Aberdeen  Proving  Grotaid,  Maryland 


AISTBACT.  Wien  considering  a randomized  complete  block  design,  it 
may  turn  out  that  the  blocks  available  are  not  large  enough  to 
acc  Demo  date  all  of  the  treatments.  Ne  are,  thus*,  naturally  lead  to  the 
consideration  of  incomplete  block  designs  (IBD) ; incomplete  in  the  sense 
that  each  block  does  not  contain  a complete  set  of  treatments.  Although 
the  parameters  which  define  an  IBD  my  satisfy  the  necessary  parametric 
relations  usually  used  for  this  purpose,  the  configuration  my  not  exist. 
A development  of  nonexistence  proofs,  utilizing  t{te  Hasse-Minkowski 
invariant,  is  presented  which  leads  to  soma  necessary  conditions  for 
symmetrical  balanced  incomplete  block  designs  (SB IBD) . Some  necessary 
conditions  are  worked  out  for  the  existence  of  intra-  and  inter-group 
balanced  inamflete  block  designs . 

1.  IMTPODUCTION.  In  order  to  pave  the  background  for  the 
formulation  of  the  problem  it  will  be  necessary  to  provide  a few 
definitions . 

Let  v denote  the  mmber  of  treatments, 
b denote  the  number  of  blocks, 

r denote  the  lumber  of  replications  of  a treatment, 
k denote  the  block  size,  i.e.,  the  number  of  treatments 
in  a block. 

An  IBD  is  an  arrangement  of  v treatments  in  b blocks  such  that  no 

treatment  occurs  more  than  once  in  any  block,  each  treatment  occurs  in 

exactly  r blocks  and  each  block  contains  exactly  k distinct  treatments, 
k<v.  . A BIBD  is  characterized  by  the  parameter  X which  indicates  the 
lumber  of  times  a pair  of  treatments  occurs  together  in  a block.  If 
V-b  then  the  design  is  said  to  be  symmetric  and  this  design  is  denoted 
by  SB1BD. 

The  five  parameters  which  define  a BIBD  are  not  algebraically 
independent.  They  are  integers  subject  to  the  following  restrictions: 

vr  - bk  (1) 

A(v-l)  - r(k-l)  (2) 
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With  every  design  is  associated  a unique  (0,1)  Matrix  celled  the 
incidence  matrix,  where "l  and  Q indicate  the  presence  or  absence  of 
a treatment  in  a block,  respectively.  The  matrix  for  a BIRD  is 
written  as 


N ■ (n^),  i»l,2,***,v  and  J-l,2,»*»,b 


a . I 1 y 

ij  I 

lo  ifVjOj, 

where  Vj,  V^,  •••,  Vy  are  the  treatments  and  Bj,  S2,  • ••',  Bfe  are  the 
blocks. 


Of  considerable  Interest  in  the  theory  of  IBDs  is  the  matrix  NN' , 
which  consists  of  v rows  and  v columns  and  provides  a description  of  the 
treatment  structure  of  the  design.  For  example,  for  the  design 
v«b»3,  r»k»2 , X-l 


*1  1 

M - 1 0 

J>  1 


1 ‘1 

2 > ■ tr-X)ly.XJyv  CJ) 


where  Iy  is  the  identity  matrix  of  order  v and  Jyy  a matrix  of  order 

vxv  of  whose  elements  are  1. 

* 

In  the  remainder  of  the  text  we  will  make  use  of  the  properties  of 
the  Legendre  symbol  (b/p),  the  Hilbert  norm  residue  symbol  (a.b)^,  and 

the  Hassc-Minkowski  invariant'  of  a matrix  A,  (A).  Shrikhande  [8]  and 

Chowla  and  Ryser  [2]  were  the  first  to  use  these  as  the  main  tools  in 
nonexistence  theory.  Only  tho  definitions  will  be  provided,  properties 
and  proofs  can  be  obtained  from  Uspenskey  and  Hcasiet  [9],  Jones  [4] 
and  Pall  [7]. 

Let  p be  a prime.  If  p does  not  divide  b,  and  X2  s b nod  p has  a 
solution  X(nod  p),  then  b is  a quadratic  residue  (QR)  mod  p,  otherwise 
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it  it  a quadratic  nonresidue  (QNR)  aod  p. 

Tha  proparty  of  QR  and  QNR  aay  ba  expressed  in  tarot  of  tha 
Ltfandra  syabol  (b/p)  by  tha  rules 


(b/p) 


♦1  if  b it  a QR 
•1  if  b it  i QNR. 


A generalization  of  the  Legendre  syabol  it  tha  Hilbert  non  re- 
sidua syabol  (a.b)y  which  it  ♦!  or  -1  according  as  tha  congruence 


2 2 ■ 
ax  *by  2 l aod  p 


has  or  has  not,  for  each  value  a,  rational  solutions  x and  y.  p is 
any  prime  and  a and  b are  rational  timbers. 


Two  symmetric  and  nonsingular  natrices  A and  B of  tha  sane  order 
n,  with  rational  elements,  arc  rationally  congruent  if  there  exists  a 
nonsingular  and  rational  matrix  C of  tha  sane  order  such  that  C'AC  • B, 
where  C'  denotes  the  transposed  matrix  of  C.  This  relationship  is 
denoted  by  A * B.  The  syabol  ''•will  also  be  used  to  denote  that  the 
square  free  parts  of  two  Integers  are  the  saae. 

Let  Dj,  Dn  ■ |a|  denote  the  leading  principal  minor 

determinants.  Define  D *1.  Then  for  D.  j*  0 the  Hasse-Minkowski 

o l 

invariant  of  a aatrix  A is  given  by 

n-1 

CD(A)  - (-1,-li  n (D-D  ) (4) 

P P jH>  * 1 3 p 

for  every  prime  p and  is  invariant  for  all  matrices  rationally  congruent 
to  A. 

A fundamental  theorem  on  rational  congruence  due  to  H.  Ilasse  [3] 
and  one  to  which  we  shall  appeal  is 

Thy  ram  1.  Two  symmetric  and  rational  matrices,  A and  B,  of  the 
same  order  are  rationally  congruent  if  and  only  if  IaMbI, 
index  A ■ index  I,  and  C (A)  ■ C (B)  for  all  primes  p! 


2.  A NECESSARY  CONDITION  TOR  TIE  EXISTENCE  OF  A SjUBD.  From 
equation  (4)  we  find  thut 


? X • 
l r • 


X X * 


■ Cr-X)Iv*XJt 


and  that  det  (NN')  ■ Cr-X^'^XOr-lfl-rMr-X}*”1*). 

It  nay  ba  shown  that  NN'  *v  I . Also,  sine#  tha  rank  of  N is  v,  index 

NN'  ■ index  Iy*v.  For  a Matrix  of  tha  fom  A-el^+fJ^,  than  (0gawa(6)) 

* 

Cp(A)  - (-l-DpC-l.ajJ^'^^C.l.gjpCM.fJptn.aJpfg.ajJ'1 
where  g ■ #♦«£ . Using  this  result  we  find 

yV  - c-i*-i)p 


v(v-l)/2 

C^(MN')  - (-1,-1)  (-l,r-X)  (v.r-X)p. 

Hence,  we  obtain 

Thao tea  2.  The  necessary  conditions  for  the  existence  of  a SBIBD 
with  paraeeters  v,r,X  axe  that 


and  if  so,  then 


for  all  prines  p. 


<r-X)v  ^ 'v  1 


C-l.r-X)J(v'1)/2(v,r-X)p  - 41 


The  design  with  paraa*ters  v»b-29 . r>k>8,  X»2  satisfies  equations 
(1)  and  (2).  Using  the  theoren  (8-2) 2®  % 1 , but  using  (S) 


C-l,6,p»’2‘/W)p  ■ (29,6)p  - (29,3)pC25.2)p. 

But  for  p»3,  (29,3) j(29,2) j ■ -1  which  isiplies  that  the  design  does  not 
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ncompiete  oiock  designs  tor  experiments  involving 
seversl  groups  of  treatments  which  are  known  as  intra-  and  inter- group 
BIBDs(I-IBIBD) . In  such  designs  one  is  interested  in  achieving  equal 
accuracy  for  comparisons  between  all  pairs  of  treatments  belonging  to 
the  same  group. 


An  I-IB1BD  is  defined  as  follows: 

(a)  The  experimental  material  is  divided  into  b blocks  of 
k units  each,  different  treatments  being  applied  to 
the  units  in  the  same  block. 

(b)  There,  are  m groups  of  treatments  consisting  of 
VrV2*****Vm 

(c)  Treatments  belonging  to  the  i-th  group  are  replicated  r^ 
times,  i«l, 2, •••,«. 

(d)  Every  pair  of  treatments  in  the  i-th  group  occur  together 
in  A^  blocks  (i*l,2,***,m),  and  every  pair  of  treatments 

one  of  which  belongs  to  the  i-th  group  and  the  other  to 
the  j-th  group  occur  together  in  A^  blocks 

(i|*j #i» ***,m) . 

The  numbers  v^,  b,  k,  r^.A^^  (i,J«l, 2, •••,*)  are  known  as  the 
parameters  of  the  I-IBIBD  m-group  design.  The  parameters  must  first 
satisfy  the  following  relations  in  order  for  the  design  to  exist. 


m 

y - 1 vi 
i»l  1 


m 

I Vi 

i»l  1 1 


^ii Cvi-1) ♦ l A^Vj  ■ r^(k-l)  , i»l  ,2,*  ••  ,m. 
j-i 


By  arranging  the  treatments  within  a group  in  order  and  the  groups 
of  treatments  in  order  we  obtain 


NN* 


K l 

B 


12 

21  A2 


* » 


Bal  B*2 


• • • 

• • t # 

* • • • 

• • • • 

• • • A 


la 

*2a 


» 


At  • WVVvj-W^v  m If 

in  particular  r^r,  A^-Aj,  xi j*x2  th*n  th®  d*#i$n  **duces  to  a group 
divisible  design. 

Consider  the  case  for  m«2 , Then 


NN' 


[*A1  B12 

[B21  \ 


Since  Aj  is  nonsingular,  the  determinant  may  be  evaluated  from 


det(NN')  ■ det(A2)det(A1-B12A"1B21) . 
After  some  manipulations  we  find 


v,-l  v,-l 

det(NN')  - (r2-X22)  1 Cr1-A11)  1 CC(V*22Cv2-l)) 

*Cri>AlI(vr1»)"vlv2xi2J 

v,-l 

Let  p£  - VXiilVl)»  pi“(ri’XiiJ  * *nd  Ri  * W*1,2,  In 

order  to  ovaluato  tho  ilosse-Hinkowski  invariant  of  NN*  for  the  2-group 
design  we  note  that  the  leading  principal  minor  determinants  may  be 
put  in  the  fora 


\ 

| - (rl-All)i‘I(r1rAu(i-l)),i-li2,...vl 


1 


4 


i 


; 


j 


1 
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whara  ?2  m ^V*22^"l,v2 


datCZj)  - R|R2-Aj2. 


i 


Using  tha  da  f ini  ton  for  the  Hassa-Minkowski  invariant  we  obtain 


VVl 


Cp(MN»)  - (-l.-l)p  j«o  (D^.-typ 


-[(-l-l)p(01..D#)p(O2.-D1)p...(O,i>-D<i.l)l 

’ *"DVj^p’  * ’ ^°Vj-*V2  '"^♦Vj-l^p 

• yv  <VAV”Vv'°vvj)p‘ 

v2 

- (-».-DPcp(v(f,  •°;>[c-*.-v1(v-Di-^] 


Tht  tans  in  tha.  brackets  art  of  a fon  siailar  to  a Hasse-Minkowski 

I 

invariant  for  Du  . Noting  this  we  writs 
v2 


Cp(NN')  - C-l.-OpCptAjJC^D’^Pj.D^p. 


If  v.+v»  ■ b and  if  dat  (NN')>0,  than  NN*  * I and  we  obtain 

1 4 Vj^Vj 


Thao ran  3.  Necessary  conditions  for  tha  existence  of  a I-IBIBD 
with  Vj*v2*band  dot  (NN')>0  are  that 
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and  if  to  then 


V2^»W*XU) 


VWD.  ,kpi-Vp  " *l 


for  all  primes  p. 


For  a design  with  parameters  v^>5,r^>ka6(  \n^22"^’^12"^* 

the  initial  parametric  equations,  (6)  and  (7),  are  satisfied, 
dot  (NN')'vl,  but  for  p«13  it  may  be  shown  that 

s(Ai)ci(v(pi'Vp  * -1 

which  implies  that  the  design  docs  not  exist.  It  may  be  noted  that  this 
design  is  also  a group  divisible  design.  Utilizing  Bose  and  Connors 

[1]  results  for  GD  designs,  the  product  is  also  -1,  confirming  our  result. 
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ABSTRACT.  Various  uses  of  order  statistics  (OS),  particularly  in 
reliability  studies  and  robust  estimation,  are  first  briefly  reviewed. 

A more  detailed  treatment  is  then  given  of  three  further  uses  of  OS, 
namely  in  data  compression,  selection  procedures,  and  in  some  double 
sampling  situations.  Concomitants  of  08  are  defined  and  applied  to  the 
last  two  areas.  It  is  shown  that  considerable  savings  may  be  possible 
in  the  estimation  of  the  mean  of  a random  variable  Y,  which  is  expensive 
to  measure,  if  a correlated  random  variable  X can  be  cheaply  determined. 
Tables  are  provided  to  allow  immediate  application  of  the  techniques 
described. 

1.  IHTBQDUCTICIN.  If  the  random  variables  Xj_,  Xg,  . . . , ^ are 
rearranged  in  ascending  order  of  magnitude  and  then  written  as 

X(l)  - X(2)  - •**  - X(n) 

Ai. 

we  call  X^rj  the  order  statistic  of  rank  r or  simply  the  r order  statis- 
tic (r  -4  1,  2,  . . . , n).  In  this  paper  we  concentrate  on  the  canmonly 
occurring  case  when  the  (unordered)  (1=1,  2,  , n)  are  independent 

r.v. 's  with  cannon  cumulative  distribution  function  (c.d. f. ) P(x).  It 
then  follows  at  once  that 

(X(n)  < x]  = Pr  [all  X±  < x}  « P^x)  (1) 

and 

Pr  (X(1)  > x)  . Pr  (all  X±  > x]  = [1  - P(x)]n  . (2) 

These  results  have  Interesting  interpretations,  for  if  X.  is  the  lifetime 

of  the  i component  In  a parallel  system  of  n like  components,  then  X/  % 

is  the  lifetime  of  the  last  component  to  fail,  i.e. , X^  io  the  lifetime 

of  the  Systran.  Likewine  X^j  is  the  lifetime  of  a series  system.  Thus, 

knowing  the  probability  distribution  of  component  lifetime,  wr  can  from 
(l)  and  (2)  deduce  the  probability  distribution  of  a parallel  or  series 
system  consisting  of  n such  components.  Indeed,  if  the  coeiponents  are 
unlike  and  X^  has  c.d. f.  P^(x),  eqs.  (1)  and  (2)  are  easily  modified  to 
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(3) 


Pr  tX(n)  <x]  * Px(x)  P2(x)  ...  Pn(x) 

Pr  {X(l)  > x}  - [1  - P1(x)]  [1  - P2(x)]  ...  U - Pn(x)].  (4) 

There  is  an  obvious  connection  with  problems  of  reliability.  If  fi.  (x)  is 

the  probability  tliat  the  i component  is  still  functioning  at  time  x, 
then  R^(x)  - 1 - r^(x),and  (It)  gives  the  well-known  result  that  the 

reliability  of  a series  system  is  the  product  of  the  component  reliabili- 
ties, it  being  misused  that  the  components  fail  independently. 

Closely  related  are  problems  of  life  testing;  for  if  n like  items 
(e. g. , light  bulbs)  are  put  on  test  simultaneously  the  life  test  will 
take  time  X/  » to  completion.  We  may  wish  to  terminate  the  test  earlier, 

say  as  soon  as  the  r item  has  failed  (Type  II  censoring).  The  test 
then  lasts  time  and  we  are  led  to  study  the  behavior  of  X^rj, 

especially  its  expected  value  p and  its  variance  o2  . As  a matter 

r :n  rsn 

of  fact,  we  will  usually  be  able  to  observe  the  order  statistics  X^, 

Xfgj,  ...,  X^j,  the  lifetimes  of  all  failed  items.  If  we  can  assume 

an  appropriate  distribution  of  lifetime,  such  as  the  normal  N(u.  c r), 
then  it  is  easy,  with  the  help  of  tables,  to  construct  linear  functions 

r r 

of  the  order  statistics  (OS)  Z a.  X/.\  and  £ b.  X,..  which  are  respec- 

i^.1  1 i.-.l  1 

tively  estimators  of  m and  a having  minimum  variance  in  the  class  of 
linear  unbiased  functions  of  the  OS. 


These  and  many  other  applications  of  OS  are  treated  in  some  detail 
in  llO]  and  [4].  Among  other  applications  we  may  single  out  the  use  of 
OD  in  the  construction  of  distribution-free  confidence  intervals  and 
tolerance  intervals,  the  use  of  the  range  (•-  • x(i))  as  a<1  esti” 

motor  of  scatter  especially  in  quality  control,  probability  plotting, 
tests  for  outliers,  and  extreme -value  theory.  In  recent  years  there  lias, 
been  particular  interest  in  finding  robust  estimators,  i.e. , estimators 
which  are  not  too  greatly  affected  by  the  presence  of  spurious  observa- 
tions or  by  our  failure  to  assume  the  correct  underlying  distribution. 

OS  play  a prominent  rSle  in  such  robust  estimators  since  the  more  central 
observations  in  an  ordered  sample  are  less  liable  to  be  affected  by  both 
spurious  observations  and  failure  of  assumptions  than  are  the  more 
extreme  ones.  Thus  a very  simple  robust  estimator  which  is  unbiased 
for  the  mean  of  any  symnetric  population  is  the  midmean 


2 

n 

i 


1 X(i) 

rnfl 


Hd. 


MM 


(n  a multiple  of  4) 


which  is  generally  more  efficient  (although  not  quite  as  robust)  than 
the  ftmiliar  sample  median 

M = X . (n  odd) 

( | (n+1)) 

= i (X  , + X , ) (n  even)  . 

2 ( \ n)  (fml) 

Both  MM  and  M are  examples  of  tr lamed  means;  M is  an  extreme  example, 
at  the  other  extreme  being  the  untrinmed  mean  X,  not  a robust  estimator. 
Many  other  types  of  robust  estimators  have  been  proposed,  the  more 
elaborate  'adapting*  themselves  to  certain  features  of  the  sample  in  an 
attempt  to  produce  an  estimator  particularly  appropriate  for  that  sample 
(see  e.g.  | lj). 

We  turn  now  to  a more  detailed  treatment  of  three  further  uses  of 
OS,  namely  in  data  compression,  selection  procedures,  and  in  some  double 
sampling  situations. 

2.  DMA  COMPRESSION.  If  the  observations  in  a large  random  sample 
of  n from  a population  of  interest  are  arranged  in  ascending  order,  then 
It  is  possible  to  estimate  the  population  mean  (and  other  parameters) 
from  a small  number  k of  OS,  and  to  do  so  with  remarkably  little  loss  of 
information  if  the  OS  are  suitably  chosen.  For  k » 2 the  optimal  esti- 
$ mator  of  the  mean  p of  a normal  population  turns  out  to  be,  from  large- 

) sample  theory, 

’ * l 

M j (Z(0. 2708)  1 Z(0.Y292))  , 

. where  e.g.  Z(O.27O0)  stands  for  the  order  statistic  with  rank  equal  to 

the  integral  part  of  O.27O0n  f 1.  Thus  if  n - 100 

^ * 1 ( 

■ M ? (X(28)  ' x(73r* 

j The  efficiency  of  |i  is  0.8l  (for  any  large  n),  so  that  its  variance  in 

, samples  of  100  is  equal  to  the  variance  of  the  best  estimator,  the  sample 

J mean,  in  samples  of  81.  For  k )|  the  optimal  estimator  is 

! m*  " .1918  [Z(.  1068) +Z(.  0932)1  h .3082  [Z(.  3512) +Z(.  6488)]  , 

# 

with  efficiency  0. 92.  It  should  also  be  noted  that  p is  much  more 
robust  than  the  sample  mean,  since  it  does  not  involve  the  more  extreme 
OS.  Table  1 gives  p*  for  k 2(2)12. 

An  interesting  application  of  p*  and  related  estimators  has  been 
made  in  Q{»ace  flights  fbj.  A large  sample  of,  say,  particle  counts  taken 
on  a space  craft  may  be  replaced  by  enough  OS  to  allow  (a)  satisfactory 


estimation  on  the  ground  of  parameters  of  Interest,  such  as  the  mb 
count,  and  (b)  a test  of  the  ass  wed  underlying  distributional  form, 
by  means  of  probability  plotting. 

3.  SELECTION  PROCEDURES.  Suppose  we  wish  to  select  the  top  k 
scorers  in  a certain  test  taken  by  n individuals  (k  < n).  How  much 
better  than  average  do  we  expect  the  selected  group  to  beV  More  pre- 
cisely, we  are  really  interested  in  the  ’selection  differential' 

A(k,  n)  = E [D(k,  n)]  , ( 

where  D(k,  n)  is  the  average  scaled  difference  between  the  selected  08 
and  the  mean  score  p,  viz. 


i n X/. v - m 
D(k,  n)  - i E ( ) , 


i=n+l-k 

where  a is  the  s.d;  of  the  test  score  X which  for  definiteness  we  take 
to  be  normally  distributed.  A(k,  n)  is  readily  evaluated  with  the  help 
of  the  important  Table  2 giving  the  expected  value  §(l|n)  of  the  ith 
largest  order  statistic  from  a standardized  normal  distribution,  viz. 

S(i|n)  ~ B(  ~ M ) . Ml 

Example  1.  A(l,  20)  = §(l|20)  = 1.86? 

A(5,  20)  = i (1.867  + 1.408  + 1.131  1 0.921  + 0.745)  = 1.214  . 

If  p = 100,  a - 16,  typical  values  for  IQ  tests,  we  see  that,  the 
expected  score  of  the  best  in  20  is 

E X(20)  11  ' U|20)  by  (7) 

LOO  + 16(1.867)  130 

and  the  expected  average  score  of  the  five  best  is 

100  + 16(1.214)  = 119.4. 

Very  extensive  tables  of  the  expected  values  of  OS  from  nonnal, 
exponential,  Welbull,  and  ganma  distributions  are  provided  in  [6].  A 
useful  approximation  to  A covering  also  non-normal  distributions,  is 
given  in  [2], 

Sometimes  we  may  also  be  interested  in  the  variance  of  D(k.  n). 

From  (6)  it  is  clear  (see  [l • j for  explicit  results)  that  this  can  be 
found  from  tables  of  variances  and  covariances  of  OS: 


6. 


rr'  :n 


cov 


hi 


- 4 


‘hh 


- 4 


(8) 


(r  - 1,  2,  . ..,  n;  r’  =1,  2,  ....  r.;  r - r'  gives  variance). 

The  means,  variances,  and  covariances  of  OS  in  samples  of  n < 20  have 
been  tabulated  not  only  for  the  standard  normal  distribution  ([10],  p. 
200  or  [9],  Table  10)  but  for  a variety  of  other  distributions  which 
depend  only  on  a location  and  a scale  parameter  (for  a listing  see  [1,. 
p.  22.6).  For  larger  samples  approximations  are  available. 


Often  we  are  interested  in  how  individuals  selected  because  of  their 
good  scores  on  X may  be  expected  to  score  on  Y,  a r.v.  to  be  measured  in 
a later  test.  We  shall  assume  that  Y is  linearly  related  to  X except 
for  an  independent  error  term  Z: 

Yi  - Hy  » - Vy)  + z±  , i = 1,  2,  ...,  n (9) 

where  u E(X),  1 

cient  between  X and  Y which  have  respective  standard  deviations  cr^  and 
cr^;  without  essential  loss  of  generality  we  take  o > 0.  From  (9)  it 
follows  that  p^  = 0 811111  = (l-P2 

(9)  occurs  when  X and  Y are  bivariate  normal  (when  Z must  also  be  normal). 


>°r 


An  important  special  case  of 


E(Y),  13  - !’aY^aX'  correl-a"tion  coeffi- 


Now  if  we  order  the  X’s,  eq.  (9)  gives 

Yj-^j  - 4y  + PdyCX^j  “ + ^[r]  r - lj  2,  . ..,  n,  (10) 

where  Yr  t and  Zr  denote  the  r.v.  rs  Y and  Z associated  with  X,  x. 

LrJ  LrJ  (r) 

Because  of  the  mutual  independence  of  all  n X’s  and  n Z’s  in  (9),  ordering 

of  the  X’s  does  not  affect  the  distribution  of  the  ZTs,  so  that  the 

Z^rj  are, like  the  Z^,n  independent  r.v.  's  with  mean  0 and  variance  o|. 

The  Yr  -1  are  the  r.v.  ' s of  interest  and  we  call  Yr  n the  concomitant  of 
1 rj  Lr] 

the  r1*"  order  statistic. 


On  taking  expectations  in  (10)  we  have 

E<Ytr]>  *lrij-  ~ > 


or 


^[r]  - P v 

E(  -Hi i ) - E ( 


X(r)  ~ 


) • 


(ID 


This  result  may  be  described  by  saying  that  for  the  r.v.  Yr  - the  selec- 
tion differential  of  is  attenuated  by  the  factor  o.  l‘rj 
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Tvm  (10)  we  have  alio 

**r  r[r]  ’ “?<»’  >rr:n  * i"a‘) 

and  for  s = 1,  2,  , . . , n (i  / r) 

cov<Y[r)'  Y[s]>  s “y  p2  ^rs:n  * 


(32) 


(13) 


4.  DOUMg  RAMPLING.  We  are  all  too  frequently  faced  with  the 
problem  of  estimating  a population  swan,  Uy  **¥>  *****  samples  smaller 

than  we  would  like  because  of  the  high  cost  of  observing  Y which  nay, 
for  example,  involve  destructive  testing.  Suppose  n items  are  available 
to  us  and  we  are  prepared  to  measure  Y on  k of  them  (k  < n).  Mow  if  it 
is  possible  to  measure  cheaply  for  each  of  the  n items  a quantity  X, 
correlated  with  Y,  then  such  auxiliary  measurements  can  be  used  to  improve 
the  estimation  of  We  shall  assume  that  X and  Y have  a bivariate  nor* 

aal  distribution  (possibly  after  suitable  trans formations ) although  the 
method  below  is  applicable  to  the  more  general  model  (9). 


Instead  of  the  mean  of  k randomly  chosen  observations  on  Y we 
propose  the  following  estimator: 

Ju] ’ t A *»,]  • 

where  Yr  -1  is  the  concomitant  of  X/  v,  the  r.  v.  of  rank  r.  among  the  X's. 
LrjJ  ^ 0 

Table  3 gives  the  values  of  r^  r2,  . , . , which  minimise  the  variance 

(obt.ln.bl.  through  (12)  nh  (13))  of  th.  hUblM*  .*i»tor  7[k]  for 

various  n and  k.  Our  double  sampling  procedure  is  therefore  as  follows: 

(i)  Arrange  the  n measurements  on  X in  ascending  order  of  magnitude. 
(11 ) Then  measure  Y on  those  k items  having  X-ranks  r.,  r 2,  ....  rfc. 
(ill)  Take  the  average  of  these  k Y-values  to  obtain  Yj^-j. 

__  Note  that  we  actually  need  to  know  only  the  ranks  of  the  X's  to  find 


w 


If  the  numerical  values  of  the  X's  are  available,  then  it  is  also 


possible  to  use  regression  estimates  with  randomly  chosen  Y's  [3]  or, 
better  still,  with  selected  concomitants  but  it  turns  out  [7]  that  the 
simple  Yj-kj  is  generally  quite  efficient.  Table  4 gives  the  variance  of 

Y^-j  as  a function  of  |p|  for  n - 19  and  49  and  k - 4 and  10.  For  p - 0, 

is  equivalent  to  Y^.  Entries  for  |p|  > 0 therefore  indicate  the 

reduction  in  variance  due  to  the  use  of  the  auxiliary  variables. 
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Uote:  For  i > *■  n use  §(i|n)  = -§(n+l-ijn)  (from  [8],  Table  29). 


Tbble  2.  Optimal  estimators  of  tie  sean  of  a normal  distribution  free,  k selected  order  statistics 
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0.  ABSTRACT.  In  tha  daaign  of  a new  ay a tan,  or  tho  maintenance  of 
an  old  ay a tan,  alloeation  of  resources  ia  of  prina  conaidaratlon.  In 
allocating  raaourcaa  It  la  oftan  banaficial  to  davalop  a aolution  that 
yialda  an  optiaml  valua  of  the  ayatan  measure  of  daalrabillty.  In  tha 
context  of  tha  problems  eonaldarad  in  thia  papar  tha  raaourcaa  to  ba 
allocatad  ara  coaponanta  already  produced  (assembly  problana)  and  aonay 
(allocation  in  tha  conatructlon  or  repair  of  systems).  The  measure  of 
desirability  for  ayataa  aaaaably  will  usually  ba  maximising  tha  expected 
number  of  aystaaia  that  perform  aatiafactortly  and  tha  measure  in  tha 
allocation  context  will  ba  maximising  tha  system  reliability. 

1.  INTRODUCTION.  Thia  work  on  the  optimal  resource  allocation 
for  maximising  -system  reliability  represents  a summary  of  research  in 
this  area  conducted  by  Cyrus  Derman,  Sheldon  Roes  and  Gerald  J.  Liebeman. 
Tha  basic  probleq  is  to  allocate  resources  in  a way  that  yields  an 
optimal  valua  of  tha  system  measure  of  desirability.  Specifically  tha 
problems  considered  can  ba  categorised  as  shown  in  the  following  table. 


Resources  to  ba 
Allocated 

— 

Measure  of 
Desirability 

Money 

Allocation  of  funds 
in  the  construction 
or  repair  of  aye terns 

Maximizing  the 
system  reliability 

Components 

already 

produced 

Assembly  of  systems 

Maximizing  the 
expected  number  of 
systems  that  perform 
satisfactorily 

II.  ABIRMBLY  PROBLEMS. 

II. 1.  General  Formulation.  Resources  consist  of  a stockpile  of 
components,  and  these  components  ara  to  be  arranged  in  some  fashion  into 
a set  of  working  systems.  This  problem  was  treated  by  Derman,  Lleberman, 

and  Ross  in  two  papers,  M0n  Optimal  Assemble  of  Systems",  NLRQ,  Vol.  lr’, 

No.  4,  December  I97S,  and  "Assembly  of  Systems  Having  Maximum  Reliability", 
NLRQ,  Vol.  21,  No.  1,  March  I974.  In  particular,  assume  that  a single 
system  has  m different  types  of  components.  Associated  with  each  com- 
ponent is  a numerical  value.  Let  (h*),  i = 1,  2,  ...  , m,  denote  this 

I 0 in 

set  of  numerical  values  in  the  m components.  Let  R(b  , b , ...,h  ) 
denote  the  probability  that  the  system  will  perform  satisfactorily,  i.e., 
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eat 


R(  b \ ...  , b*)  la  the  reliability  of  the  ay • tarn.  Por  example, 

let  b“  denote  Cha  probability  that  tha  component  will  work 
whan  component  performancaa  are  independent.  If  all  component*  mu  at 
work  than  tha  reliability  is  Just  R « bl  • b2  •••  bm.  Nevertheless, 
tbia  formulation  allows  for  tha  component  performancaa  to  be  dependant. 
Mow  auppoae  that  there  are  n unite  of  each  component  with  corre- 
■ ponding  bj,  feg,  ...  , b*  for  every  1.  The  problem  cone ldarad 
la  to  arrange  the  nm  unite  into  n ay item*,  to  maximise  the  expected 
number  of  eystema  that  perform  satisfactorily,  i.e.,  maximise  E(N), 
where  N ie  the  nuafcer  of  systems  that  work.  Of  course  this  criterion 
la  equivalent  to  maxiadsing  the  sum  of  the  n reliabilities. 


11.2.  "Scries11  Results.  If  R is  a distribution  function 
(includes  a series  system  of  independent  components)  and  if 
bj  < bp  < • • • < b*  for  i s 1,  2,  ...  m then  the  n systems 

represented  by  the  partitions  (b[,  b*,  ...  , b®),. ..,(b*,b*,. . .,b") 
is  the  optimal  arrangement,  L.e.f  put  the  "worst"  together,  the  second 
"worst"  together,  ...  , and  finally,  the  "best"  together.  Furthermore, 
if  R(b*,  ...  , bm)  > 1/2  for  every  permutation  of  the  units,  then 
this  same  arrangement  also  minimizes  the  variance  of  tha  number  of 
systems  that  perform  satisfactorily.  Finally,  if 

u/.l  t.g  .1  .2  ■ m 

R(b,b,..,  ,b)=b  b •••b  , 


where 


1 ch 

b*  '=  P(i  component  works)  , 
then  this  same  arrangement  maximizes 

f(N  > r)  , 


for  each  r. 
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II. 5.  Results  for  Parallel  Systems . Th«  solution  to  this  problem, 
i.e.,  tho  arrangement  that  maxiaizas  K(H) , attempts  to  ask#  the  reli- 
sbllltlos  of  oach  systoa  ss  equal  as  possible.  Indeed,  if  a partition 
exiata  that  awkea  the  reliabilities  equal,  it  is  optimal.  Unfortunately 
such  an  arrangeaant  may  not  exist.  However,  bounds  are  available  so 
that  the  aiaxiaua  expected  nuaber  of  systems  that  perforn  satisfactorily 
will  be  within  these  bounds;  the  bounds  being  a function  of  an  arbitrary 
chosen  partition.  Finally,  an  improvement  algorithm  is  also  available. 
Essentially,  this  algorithm  looks  for  pairwise  interchanges  of  units 
which  make  two  systems  hsvs  "mere  equal"  reliabilities.  Incidently, 
ell  the  results  obtained  for  this  problem  carry  over  to  the  original 
problem  where  each  system  is  required  to  contain  exactly  m components. 


II. 6.  Another  Application  of  Assembly  of  Systems  Model.  A version 


of  the  target  assignment  problem  can  be  related  to  the  general  parallel 

system  assembly  formulation.  Manne's  "A  Target  Assignment  Model", 

Operations  Research,  Vol.  6,  No.  3,  I958,  troata  essentially  the  follow 

ing  target  assignment  problem.  There  are  t weapons  to  be  assigned 

against  n targets.  Let  p. . be  the  probability  that  the  1th  weapon 
tit 

will  destroy  the  j target  if  it  alone  is  assigned  to  it.  The  objective 

ia  to  minimize  the  expected  number  of  surviving  targets.  If  x.  . denotes 

th  th 

the  probability  that  the  i weapon  1b  assigned  to  the  j target,  then 
the  are  sought  that  minimize 


£ n (i  - p..  x ) , 

j=i  t=i  1J 


subject  to 


£ *4,  - 1 , 


i ..  1,  2,  . , . , t , 


f 
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11. 7.  Results  for  Target  Assignment  Problem.  Manne  points  out 

that  this  Is  a nonlinear  problem  and  an  exact  solution  is  not  known. 

However,  by  making  some  simplifying  assumptions  he  presents  two 

approximate  solutions  (one  due  to  himself  and  one  due  to  G.B.  Dantztg), 

The  analogous  concepts  In  the  assembly  model  version  would  assume  that 

th  tli 

x. , is  zero  or  one.  The  1 weapon  corresponds  to  the  i unit.  The 
jt"  target  corresponds  to  the  j system.  Whereas  p,.  depends  on 
both  the  weapon  and  target,  the  probability  of  a unit  working  in  the 
assembly  context  is  assumed  to  be  independent  of  which  system  it  la 

fcH 

placed  in  and  hence  is  denoted  by  p^.  This  would  imply  that  the  ic 
weapon  has  the  same  probability  of  destroying  each  target.  Under 
this  assumption  (which  is  less  stringent  than  those  proposed  by  Manne) 
the  system  assembly  results  are  relevant. 

11. 8.  Other  Work  on  System  Assembly.  An  independent  and  earlier 
discussion  of  the  assembly  problem  with  other  application  can  be 
found  in  Abe,  "Multi-Stage  Rearrangement  Problem  and  its  Application 
to  Multiple  System  Reliability",  Journal  of  the  Operations  Research 
Society  of  Japan,  Vol.  11,  No.  1,  November  I968.  He  uses  somewhat 
different  techniques,  particularly  in  the  parallel  case.  In  the 
reliability  context  he  always  assumes  independence  of  components, 

and  hiB  version  of  the  parallel  system  problem  requires  each  system 
to  contain  m units.  However,  for  this  case  he  obtains  some  suffi- 
cient conditions  for  optimality  weaker  than  equal  reliabilities.  He 
also  points  out  that  the  assembly  model  can  be  used  in  search  and 
assignment  contexts. 

III.  ALLOCATION  PROBLEMS. 

111,1.  General  Allocation  Problem.  Let  A denote  a fixed 
amount  of  money  to  be  used  to  build  a single  system  consisting  of 
n components.  Define  P (x^)  as  the  probability  that  component  i 
will  work  if  x^  is  allocated  to  its  production.  The  problem  is  to 
choose  Xj,  x^,  . , . , x so  as  to  maximize  the  probability  that  the 
system  works,  i . e, , 


8U9 


maximize 


Rf  P 


l(Xl)' 


Po( xo) , 


P (x  ) J 
nv  n/ 


subject  to 

V x . A . 

i ’ 

where  R is  the  probability  that  the  system  performs  satisfactorily. 

O 

III.:.1.  Special  Cases  of  General  Model. 

A.  System  can  be  represented  by  n independent  modular  subsystems 
connect  in  parallel  and/or  series:  in  such  cases  R lias  an  indentifiable 
simple  form.  Bodin,  in  his  paper,  "Optimization  Procedures  for  the 
Analysis  of  Coherent  Structures",  IBM  Data  Processing  Division  Report 

No.  320-3909,  July  lq68,  has  done  some  work  on  this  problem.  He 
developed  some  algorithms,  but  essentially,  the  solution  is  stilL  unknown. 

B.  In  order  to  get  some  insight  into  this  general  problem,  a 
simple  version  is  considered  by  Derman,  Ross,  and  Lieberman  in  a 
series  of  papers,  ( i)  "Assembly  of  Systems  Having  Maximum  Reliability", 
Naval  Research  Logistics  Quarterly,  Vol.  21,  No.  1,  March  I97V,  (ii) 
"Optimal  Allocations  in  the  Construction  of  k out  of  n Reliability 
Systems"  Management  Science,  Vol.  21,  No.  3,  November  1971! , and  (iii) 

"A  Stochastic  Sequential  Allocation  Model",  Technical  Report  No.  16C>, 
Stanford  University,  September  lt«,  197*1.  Thene  papers  assume  that 
P^(x)  P(x)  for  all  components,  and  the  system  has  a special 
structure,  i.e.,  it  is  a k out  of  n system.  However,  another 
facet  is  added,  namely,  in  some  of  our  models  allocation  decisions 

can  be  made  sequentially. 

11,1.3.  An  Allocation  of  Money  Resources  Model.  Suppose  A 
denotes  a fixed  amount  of  money  to  build  a single  system  consisting 
of  n components.  Define  P(x)  as  the  probability  that  a component 
will  work  if  amount  x is  allocated  to  its  production. 
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A*  Non-Se<iuential  Version:  Choose  x,.  x 

i'  c’  n 

order  to  maximise  R(P(x^),  ...  , P(xn)),  i.e.,  the  probability 
that  the  By stem  wo.  ‘ks . 


B.  Sequential  Version:  is  allocated  to  produce  the 

first  component.  Using  the  information  as  to  whether  the  first 
allocation  produced  a working  or  non  working  component,  is 

then  allocated  to  produce  a 2nd  component.  We  proceed  in  this 
manner,  making  no  more  than  n allocations.  The  problem  is  to 
choose  x ^ , Xg,  up  to  x , if  necessary,  sequentially  to  maximize 
the  probability  that  the  system  will  work. 

It  is  assumed  that  an  n component  system  will  work  If  at 
least  k of  the  components  function. 


III. 4.  Results  for  an  Allocation  of  Money  Resources  Model. 

( i)  k 1 (parallel  system)  - sequential  or  non-sequential  version. 
If  log(l  - P(x) ) is  convex,  then  the  x's  are  chose  so  that 


X1  ' X2  ' 


A 

n 


If  log( 1 - P( x) ) is  concave,  then  the  x’s  are  chosen  so  that 


x!  A>  x2 


. x 
> n 


o . 


( ii)  General  k (note  k . n is  series-system)  - sequential  or 
non-sequential  version. 

If  log( 1 - P(x))  is  (strictly)  convex  then  if  one  wants  to 
sequentially  build  k working  components  in  at  moat  n attempts, 
n > k,  then  it  is  (uniquely)  optimal  to  allocate  A/n  at  each  stage 
when  A is  the  total  resource  available.  Thus,  it  also  follows 
that  the  9ame  allocation  is  optimal  for  the  non-sequential  model. 
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(ill)  Special  case  of  P( x)  --  x - sequential  case  - k = 1 and  2. 
Since  log{ 1-x)  is  a concave  function,  the  results  presented  under 

( i)  hold  for  k 1,  L.e.,  x,  A,  x,.  = 0 x ~ 0.  Exact 

results  can  also  be  obtained  for  the  caae  of  k - 2.  The  optimal 
policy  n*  can  be  described  as  follows.  When  the  present  amount 
available  is  y and  at  most  n additional  components  can  be  built, 
then 

a)  if  only  one  additional  working  component  is  needed,  n*  allocates 
mln(y,l),  and 

b)  if  two  additional  working  components  are  needed,  «x  allocates 


( iv)  Special  case  of  P(x)  x - sequential  case  - general  k. 

For  the  general  case  (any  k) , it  is  conjectured  that  the  optimaL 
policy  it xx  is  such  that  when  the  present  amount  available  1b  y 
and  if  k additional  working  components  are  needed  with  at-noit  n 
stages  to  go,  then  k**  calls  for  allocating 

If  y 2-5TT  (k-’> 

(v)  Special  case  of  P(x)  - \ - non- sequential  case, 
fhe  optimal  allocation  x*  - ( x|,  x^,  ...  , x*)  is  such  that  all 
of  the  non- zero  elements  of  x*  are  equal.  It  is  not  clear  how 
many  non-zero  elements  are  presented  in  an  optimal  allocation, 
although  some  indications  are  available  for  A near  k or  zero; 
for  A near  k the  number  of  non-zero  elements  is  small  while 
for  A near  zero  the  number  is  large. 


» r*  . U 
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III,').  A Stochastic  Sequential  Allocation  Model  (SSAM).  The 
following  model  is  described  in  terms  of  an  Investment  problem, 
although  several  ocher  interpretations  are  available  for  this  model. 

We  have  D units  available  for  Investment.  During  each  of  N time 
periods  an  opportunity  to  invest  will  occur  with  probability  p.  As 
soon  as  an  opportunity  presents  itself  we  must  decide  how  much  of 
our  available  resources  to  invest.  If  we  invest  x,  then  we  obtain 
an  expected  profit  P(x),  where  P is  a nondecreas ing  continuous 
function.  The  amount  x then  becomes  unavailable  for  future  invest- 
ment. The  problem  is  to  decide  how  much  to  invest  at  each  opportunity 
so  as  to  maximize  the  total  expected  profit. 

III,().  Other  Applications  of  SSAM. 

A.  Target  Assignment  Application  of  SSAM.  Suppose  that  there 
are  D units  of  ammunition  available,  and  for  each  of  N time  units, 
say  days,  the  target  may  be  under  attack.  During  each  of  the  N 
days  enemy  planes  will  attack  with  probability  p.  As  soon  uh  planes 
appear,  we  must  decide  how  much  of  our  ammunition  to  expend.  If  x 
units  of  ammunition  are  expended  then  P(x)  is  the  expected  number 
of  enemy  planes  that  will  be  downed. 

I),  Allocation  of  Research  Effort  Application  of  SSAM.  A 
proposal  is  received  and  sent  out  for  review.  From  past  history 
the  fraction  of  those  receiving  favorable  reviews  are  p (p  may  be 
thought  of  as  the  probability  of  the  referee  recommending  funding). 
However,  the  review  comes  in  as  recommending  approval  or  rejection. 

If  the  review  is  positive,  how  much  should  be  allocated  to  each 
proposal.  We  have  a total  of  D dollars  available.  If  x is 
allocated,  then  P(x)  is  the  return  of  the  investment.  We  have 
N proposals  to  be  sent  for  review  and  decisions  must  be  made 
sequentially.  Another  interpretation  is  for  p to  represent  the 
probability  of  a favorable  report  being  received  in  each  of  N 
given  days. 
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III. 7.  Results  fur  SSAM  Model.  When  P(x)  is  convex,  it  is 
easily  slicwii  that  the  optimal  policy  to  to  invest  everything  when 
an  opportunity  presents  itself.  When  P(x)  = log  x,  and  if  there 
are  n time  periods  to  go  and  A dollars  available  then  the  optimal 
amount  to  invest  at  this  time,  xn(A),  is  given  by 

n 1 i-  (n-l)p 

Another  case  where  the  optimal  policy  can  be  made  explicit  is  when 

P(  x)  x°  , 0 < a.  1 . 

iil.M.  Further  Results  for  SSAM  Model.  When  P(x)  is  a general 
concave  function,  it  is  only  possible  to  describe  the  structure  of  the 
optimal  pulley.  In  particular,  if  V(n,A)  denotes  the  supremal 
expected  additional  profit  attainable  when  there  are  n time  periods 
to  go,  A dollars  available,  and  an  investment  opportunity  is  at  hand, 
and  xn(A)  the  optimal  amount  to  invest  at  this  time,  then 
( i)  V(n,A)  is  a concave  function  of  A, 

(il)  xn(A)  is  a nondecreasing  function  of  A,  and 

(iii)  xn(A)  is  a nonincreasing  function  of  n, 

This  structure  can  bo  used  to  simplify  the  necessary  computations, 
but  does  not  yield  a closed  form  expression  for  the  optimal  value  to 
Invest. 


II1.0.  The  Sequential  Stochastic  Assignment  Problem.  It  Is 


assumed  that  there  are  n men  available  to  perform  n jobs.  The 
jobs  arrive  in  sequential  order  with  each  job  being  categorized 

before  a man  is  assigned  to  it.  It  is  assumed  that  the  category  t5, 

th  J 

of  the  j job  is  determined  by  a probability  measure  over  all  possible 

categories  and  that  (c  } (j  1,  ...  , n)  are  independent  with  the 

‘ til 

same  probability  measure.  The  i man  has  a value  x,  (0  < x.  < 1, 

th  1 1 

I l,  ...  , n)  associated  with  him.  If  the  i man  is  assigned  to 


Vl*»v 
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the  j Job  the  (expected)  return  Is  a known  function  ?(x^,  Oj). 

After  a man  is  assigned  to  a job,  he  is  unavailable  for  future 
assignments.  The  objective  is  to  assign  the  workers  sequentially 
to  maximize  total  expected  return.  Tills  problem  was  treated  by 
Derman,  Licberman,  and  Ross  in  "A  Sequential  Stochastic  Assignment 
Problem",  Management  Science,  Vol.  18,  No.  7,  March  L9?2. 

III.  10.  Relationship  of  SSAM  to  the  Sequential  Stochastic 

Assignment  Problem.  In  the  stochastic  sequential  allocation  model 
the  possible  categories  are  two  in  number.  The  first  category, 
which  occurs  with  probability  1-p,  corresponds  to  P(x,  0^)  = 0 
(no  occurrence  of  an  opportunity);  the  second,  which  occurs  with 
probability  p,  to  P(x,  P(x)  (occurrence  of  an  opportunity). 

The  n men  each  having  a value  x^  ( i - 1,  , . . , n)  is  equivalent 
to  a predetermined  division  of  the  total  resources  ^ x^  =■  D and 
the  problem  is  simply  that  of  assigning  the  predetermined  values. 

The  allocation  problem  requires  instead  of  a sequential  assignment 
of  values  a sequential  division  of  the  resources,  Beyond  occurring 
or  not  occurring  the  present  allocation  model  does  not  permit  a 
more  refined  weighing  of  the  return  function  since  P( x)  is  assumed 
to  be  the  same  for  each  occurrence. 

IV.  CONCLUSIONS . Are  results  relevant  for  solving  the  general 
allocation  problem"  Can  they  be  used  10  aid  in  the  design  of  a new 
system  or  in  the  maintenance  of  an  oLd  system.  Obtaining  an  explicit 
solution  to  the  general  allocation  problem  requires  intimate  knowledge 
of  cost  functions  and  system  performance.  Similarly,  this  information 
also  appears  to  be  necessary  for  obtaining  explicit  solutions  to  the 
"simplified"  models  considered  in  this  paper  -•  with  one  important 
difference  --  namely,  most  solutions  lead  to  qualitative  results. 
Admittedly,  the  "optimal  solution"  to  the  general  allocation  problem 
is  stllL  open,  lust  the  results  presented  In  this  paper  are  useful  in 
enhancing  "engineering  intuition"  for  the  purpose  of  getting  "good" 
answers  to  a most  difficult  problem. 


Finally,  the  models  presented,  usually  In  a reliability  context, 
are  quite  broad  so  that  they  are  useful  in  other  areas,  e.g.,  the 
assembly  of  parallel  systems  and  the  stochastic  sequential  allocation 
model  are  related  to  target  assignment  problems. 


856 


SIMPLE  STATISTICAL  ALTERNATIVES  TO  THE  METHOD  OF  LEAST  SQUARES  FOR  THE 
DETERMINATION  OF  X- INTERCEPT  AND  SLOPE 

Joseph  F.  Hanniyan  and  Mary  L.  Powers 
Research  Institute 

U.  S.  Army  Engineer  Topographic  Laboratories 
Fort  Belvoir,  Virginia 


ABSTRACT . It  is  standard  procedure  to  use  the  method  of  least  squares 
to  obtain  the  best  straight  line  fit  to  a given  set  of  data  samples  which 
are  known  or  believed  to  represent  some  physical  quantity  having  a linear 
characteristic.  There  are  two  reasons  for  seeking  simple  alternatives  to 
the  method  of  least  squares.  The  first  is  that  some  field  applications 
require  only  a determination  of  the  X-intercept.  The  method  of  least 
squares  determines  the  slope  and  Y-intercept  from  which  the  X-intercept  is 
then  calculated.  The  second  reason  is  that  in  some  field  applications  only 
limited  amounts  of  data  can  be  retained,  stored,  or  manipulated  because  of 
restrictions  on  the  ADP  systems.  This  paper  examines  several  alternatives 
for  reducing  the  amount  of  data  used  and  a method  of  determining  the  X- 
intercept.  Of  course,  the  slope  appears  in  the  calculation.  However,  it 
is  not  determined  explicitly  and  the  Y-intercept  is  not  determined  at  all. 

For  this  investigation,  a set  of  100  data  samples  are  used.  These  data 
samples  were  obtained  from  a random  number  generator  then  normalized  and 
applied  to  the  "Y'1  coordinate  appropriate  to  100  "x"  coordinates  for  the 
line  y * -x  + 50.  Results  are  evaluated  in  terms  of  closeness  to  the 
original  X-intercept  (i.e.,  x ■ 50). 

1.  INTRODUCTION.  The  purpose  of  this  paper  is  to  present  the  results 
of  an  investigation  which  explored  a statistical  alternative  and  a statistical 
variation  on  the  method  of  least  squares.  In  addition,  the  effect  of  simply 
reducing  the  number  of  data  samples  was  examined.  These  latter  results  are 
presented  in  Appendix  A simply  to  show  the  effect  of  loss  of  data.  The  loss 
could  be  deliberate  to  reduce  the  amount  of  data  for  simplicity  or  it  could 
be  accidental  due  to  equipment  malfunction. 

The  method  of  least  squares  has  many  applications  from  massive  batch 
reductions,  such  as  simultaneous  adjustments  of  large  geodetic  networks, 
to  the  relatively  simple,  straight  forward  determination  of  the  "best" 
straight  line  fit  for  data  which  are  known  or  believed  to  represent  a 
physical  quantity  having  a linear  characteristic.  "Best"  fit  in  the 
latter  case  means  the  equation  for  the  straight  line  which  comes  closer  to 
all  the  data  points  than  any  other  straight  line.  In  other  words,  it 
usually  means  the  line  for  which  the  deviations  between  the  given  or- 
dinate values  of  the  data  points  and  the  corresponding  ordinates  of  the 
line  arc  a minimum.  This  is  the  specific  application  being  considered  in 
this  investigation. 


There  are  two  reasons  for  seeking  alternatives  to  the  method  of  least 
squares.  The  first  is  that  some  field  applications  require  only  a deter- 
mination of  the  X-intercept.  The  method  of  least  squares  determines  the 
slope  and  Y-intercept  from  which  the  X-intercept  is  calculated.  The  second 
reason  is  that  in  some  field  applications  only  limited  amounts  of  data  can 
be  stored,  retained,  and  manipulated  because  of  restrictions  on  the  size 
and  capacity  of  the  ADP  systems  being  used. 

A specific  application  to  illustrate  the  above  is  the  potential  use  of 
an  I'M  discriminator  to  determine  a particular  frequency.  The  discriminator 
(here  used  in  the  generic  sense)  could  then  be  used  in  the  field  as  a cheap 
replacement  or  a backup  for  the  more  expensive  frequency  counter.  A dis- 
criminator has  a well  known  linear  frequency  versus  voltage  characteristic. 

This  is  apparent  in  data  and  figures  presented  in  Appendix  B.  A disadvantage 
of  the  discriminator  is  that  it  does  not  have  the  resolution  of  the  more 
expensive  frequency  counter.  However,  a discriminator  can  be  sampled  every 
faw  milliseconds  compared  to  the  usual  second  or  more  for  a frequency 
counter.  The  sampling  rate  for  a discriminator  is  limited  only  by  the  settling 
time  of  the  sampling  and  data  storage  circuits  while  a frequency  counter  must 
count  for  a specific  time  interval  in  order  to  obtain  the  desired  resolution. 
Compared  to  a frequency  counter  a discriminator  can  provide  a very  large 
number  of  data  samples.  The  method  of  least  squares  is  recognized  as  a 
good,  standard  procedure  to  use  to  determine  the  best  straight  line  to  fit 
for  such  data.  A specific  frequency  or  frequencies  can  then  be  determined 
from  the  equation  of  the  line  so  obtained. 

An  experiment  was  conducted  at  the  Research  Institute,  0.  S.  Army 
Engineer  Topographic  Laboratories,  Fort  Belvoir,  VA,  to  determine  the  fea-. 
Sibility  of  using  discriminators  for  the  purpose  of  determining  a specific 
frequency  (e.g. , 300.000  kilocycles).  Details  of  the  experiment  are  planned 
for  inclusion  in  a separate  USAETL  report.  As  background  information  for 
this  paper,  Appendix  B contains  a block  diagram  of  the  experiment,  tables 
of  representative  data,  and  plots  of  the  data. 

2.  INVESTIGATION.  The  approach  taken  in  this  investigation  was 
analytical  and  empirical  rather  than  theoretical.  The  investigation  used 
both  experimental  data  (shown  in  Appendix  B)  and  hypothetical  data. 

The  hypothetical  data  was  based  on  the  line  Y ■ mX  + b where  the  slope 
m was  chosen  to  be  -1  avid  the  X-intercept  was  chosen  to  be  +50.  A hundred 
data  samples  (X,  Y values)  were  obtained  using  equally  spaced  absissa  values 
from  0 to  99  and  calculating  the  corresponding  ordinate  values.  These 
ordinate  values  were  then  changed  by  the  application  of  "normalized"  random 
numbers.  The  distribution  is  shown  in  Figure  1.  These  numbers  were  obtained 
from  a random  number  generator  and  then  "normalized."  The  "normalized" 
random  numbers  are  shown  in  Table  1.  One  hundred  data  samples  are  not 
enough  to  show  a good  normal  distribution,  hence  the  quotes  on  normalized. 
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However,  this  is  the  number  of  samples  which  would  be  expected  in  a specific 
application  and  it  is  more  than  actually  desired  from  the  data  storaqe  and 
manipulation  standpoint.  In  a real  case,  even  though  the  physical  quantity 
would  be  expected  to  have  a normal  distribution  under  long  term  conditions, 
such  distribution  might  not  actually  appear  in  a limited  number  of  sanples. 
Hence,  it  is  believed  that  the  values  used  are  reasonable  for  what  would  be 
obtained  in  a real  situation. 

It  seams  appropriate  at  this  point  to  illustrate  briefly  what  is  in- 
volved in  determining  the  X-intercept  by  the  method  of  least  squares. 

DATA  REQUIRED 

X'  Y XY  X2 


IX 


2Y 


IXY 


ZXJ 


NORMAL  EQUATIONS 
I Y-  m EX  + bN 
IXY  ■ mEX2  + b I X 

Where  m is  the  slope,  b i3  the  Y-intercept,  and  N is  the  number 
of  data  samples.  The  slope  and  Y-intercept  are  obtained  by  the 
simultaneous  solution  of  the  NORMAL  EQUATIONS.  The  x-intercept  is  then 
determined  by  equation: 

*•-1 


It  is  significant  to  note  that  as  the  data  samples  (X,  Y)  are 
put  into  the  ADP  system,  they  are  summed;  they  are  multiplied  and 
then  summed;  and  the  X values  are  squared  and  then  summed.  These 
operations  require  either  a certain  amount  of  memory  to  store  data 
while  the  arithmetical  operations  are  performed  or  a parallel  ADP 
capability  to  perform  the  multiplication,  squaring  and  summing 
operations  simultaneously  as  the  data  samples  are  put  into  the  ADP 
system. 


The  next  thing  to  note  is  that  the  NORMAL  EQUATIONS  must  be 
solved  simultaneously  for  the  slope  (m)  and  tho  Y-intercept  (b) . 

From  these  values  tho  X-intercept  is  then  calculated  from  a third 
equation.  Physically,  the  Y-intercopt  has  no  significance  in  our 
experiment,  however,  tho  slope  gives  the  rate  of  change  of  the  fre- 
quency. The  slope  could  be  used  to  extrapolate  the  data  in  the  event 
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of  * malfunction  which  prevents  getting  enough  data  samples  to  include 
the  desired  frequency.  Although  it  ia  not  eiqpected  to  give  a high  de- 
gree of  accuracy,  extrapolation  might  be  better  than  no  answer  at  all. 
Although  the  slope  is  of  secondary  interest,  it  is  included  in  the  findings 
since  it  does  have  some  physical  significance. 

3.  FINDINGS  AND  RESULTS.  The  investigation  was  performed  with  the 
hypothetical  data  first  and  then  the  experimental  data.  The  impressive 
finding  was  the  extremely  good  results  obtained  by  simple  statistical 
averaging  of  consecutive  data  samples  to  form  new  sets  of  data.  This 
gave  a reduced  number  of  new  data  samples  which  reduces  the  amount  of 
storage  required.  It  even  eliminates  the  method  of  least  squares  when 
carried  to  the  extreme.  For  example,  by  dividing  the  data  into  halves, 
then  averaging  each  half  to  obtain  two  average  X coordinates  and  two 
average  Y coordinates,  the  X- intercept  can  be  obtained  directly  from  the 
linear  equation: 


X 


(X2  - X!) 
<Y2  - YX) 


(-Yj)  + {X} } 


The  above  equation  is  derived  from  the  equation  of  a straight  line 
between  two  points.  Note  that  the  X-intercept  is  obtained  immediately 
and  that  the  slope,  although  present,  is  not  derived  explicitly. 

The  results  obtained  by  reducing  the  data  by  simple  statistical 
averaging  of  consecutive  data  samples  are  shown  in  Table  2 for  t^ie 
hypothetical  data  and  Tables  3 and  4 for  the  experimental  test  data. 

The  X-intercept  and  slope  obtained  by  the  method  of  least  squares  using 
all  the  data  samples  are  shown  in  Table  5 for  comparison  purposes. 

4.  CONCLUSION.  Based  on  the  empirical  and  analytical  approach  taken 
in  this  investigation,  it  is  concluded  that  simple  statistical  averaging  of 
the  data  samples  so  as  to  obtain  two  average  data  samples  (i.e.,  one 
average  sample  for  each  half  of  the  data)  and  the  application  of  the 
single  equation  for  the  straight  line  between  two  points  provides  an 
alternative  to  the  method  of  least  squares  for  the  determination  of  the 
X-intercept  and  slope.  The  accuracy  obtainable  with  this  procedure  is 
believed  to  be  acceptable  for  many  applications,  especially  in  view  of  the 
simplicity  of  the  calculations. 

It  is  also  concluded  that  statistical  averaging  of  consecutive  data 
samples  provides  a variation  to  the  method  of  least  squares  which  has  an 
accuracy  that  would  be  acceptable  in  many  applications.  The  averaging  of 
consecutive  data  samples  could  btr  used  to  reduce  the  data  storage  re- 
quirements or  possibily  permit:  parallel  arithmetical  operations. 

It  is  recognized  that  these  conclusions  are  based  on  limited  amounts 
of  data.  However,  intuition  tends  to  support  the  logic  of  these  con- 
clusions and  the  numerical  results  tend  to  verify  them. 
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table-  ?.  hypothetical  data 


LEAST  SQUARES  SOLUTION  USING 


i 


STRAIGHT  LINE  BETWEEN 
TWO  END  POINTS  SOLUTION  USING 


Average*  of  Each  of  10 

Averages  of  Eaoii  of  20 

AVERAGES  OF 

EACH  HALF 

Consecutive  Semples 

Consecutive  samples 

(End  Points 

Of  Line) 

X 

Y 

X t 

X 

Y 

4.5 

45.58589 

9.5  40.42934 

24.5 

25.51836 

14.5 

35.27179 

25.5  20.46093 

74.5 

24.51836 

24.5 

25.87178 

49.5  .61302 

34.5 

15.05009 

69.5  -19.47466 

44.5 

5.81243 

89.5  -39.52864 

54.5 

-4. 58638 

64.5 

•14.43884 

74.5 

-24.51048 

•4.5 

-34.61079 

94.5 

-44.44648 

X-Interc*pt 

Slope 

X-Intercept  Slope 

X-Zntercept 

'Slops 

49.99992 

-1.00015 

50.00037  -.99926 

49.99963 

-1.00073 

TABUS  3.  TEST  SET  NO.  1 

IEAST  SQUARES  SOLUTION  USING 

STRAIGHT  LINE  BETWEEN 

TWO  END  POINTS  : 

SOLUTION  USING 

Averages  of  Each  of  5 

Averages  of  Each  of  10 

AVERAGES  OF 

EACH  HALF 

Consecutive  Samples 

Consecutive  Samples 

(End  Points 

of  Line) 

Frequency 

Voltaqe 

Frequency  Voltaqe 

Frequency 

Voltage 

299.910 

37.6 

299.923  32.0 

299.9475 

21.5 

299.935 

26.4 

299.972  11.0 

300.0475 

-21.95 

299.960 

16.0 

300.022  -10.9 

299. 9SS 

6.0 

300.072  -33.0 

300.010 

-5.8 

300.035 

-16.0 

300.060 

-27.0 

300.065 

-39.0 

X-lntercept 

Slope 

X-Zntercept  Slope 

X-Zntercept 

Slops 

299.99698 

-433.80938 

299.99698  -433.79977 

299.99698 

-434.50000 
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TABLE  4.  TEST  SET  NO.  2 


LEAST  SQUARES  SOLUTION  USING  STRAIGHT  LINE  BETWEEN 

TWO  END  POINTS  SOLUTION  USING 


Averages  of  Each  of  5 
Consecutive  Samples 

Averaqes  of 
(End  Points 

Each  Half 
of  Line) 

Frequency 

Voltage 

X 

? 

299. 920 

33.6 

299.94500 

21.90000 

299.970 

10.2 

300.020 

-5.6 

300. 04500 

-16.70000 

300.070 

-27.8 

X- Intercept 

Slope 

X- Intercept 

Slope 

300.00150 

-400.00002 

300.00174 

-386.00000 

TABLE  5 

Least  Squares  Solution  Using  All  Data  Samples 


Hypothetical  Data 


X- Intercept 


Slope 


49.99901 


1.00039 


Test  Set  Number  1 


299.99691 


-434.63343 


Test  Set  Number  2 


300,00117 


405.58409 


APPENDIX  A 


LEAST  SQUARES  SOLUTION 
HYPOTHETICAL  DATA  + RANDOM  NUMBERS 


X 

X- INTERCEPT 

SLOPE 

N 

0; 1/2/ » • • 

..,99 

49.99981 

-1.00039 

100 

0/2/4  pee  » 

. .,98 

50.04974 

-.99684 

50 

1/3/S/* • * 

..  ,99 

49. 95376 

-1.00389 

50 

0/3/6/* • * 

..  ,99 

51.48230 

-1.09439 

33 

0/5/ 10/ • e 

..  ,95 

50.15983 

-1.00213 

20 

0 / 10/ 20 / e 

..,90 

49.85273 

-.99125 

10 

Least  Squares  Solution 

Hypothetical  Data  4-  Normalized  Random  Numbers 

X ■ 0,  10,  20,  90  Omitted 

N » 99 


omit  x 

X-intercept 

Slope 

0 

50.00038 

-1.00042 

10 

50.01663 

-1.00121 

20 

50.00284 

-1.00050 

30 

50.00769 

-1.00058 

40 

49. 99136 

-1.00029 

50 

50.01514 

-1.00038 

60 

49.98716 

-1.00055 

70 

49.99263 

-1.00056 

80 

49.98575 

-1.00090 

90 

50.01784 

-0.99953 

Least  Squares  Solution 
Hypothetical  Data  and  Normalized  Random 

Numbers 

Omit 

jc 

Randomly  Selected  Point  Omitted 
N - 99 

X-intercept 

Slope 

56 

50. 00985 

’ -1.00031 

46 

49.98573 

-1.00033 

29 

49.98723 

-1.00008 

14 

50.00161 

-1.00047 

12 

50.00397 

-1.00058 

97 

50.00648 

-1.00002 

17 

50.01115 

-1.00084 

4 

50.00923 

-1.00092 

79 

50.00599 

-1.00017 

60 

49.98716 

-1.00055 
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LEAST  SQUARES  SOLUTION 
HYPOTHETICAL  DATA  + RANDOM  NUMBERS 
OMIT  SERIES  OF  FIVE  VALUES 
N « 95 


Omit  X 

X- Intercept 

Slope 

0-4 

49.99346 

-.99998 

10-14 

50.01240 

-1.00101 

20-24 

50.00259 

-1.00049 

30-34 

50.03673 

-1.00119 

40-44 

49.98093 

-1.00025 

50-54 

50.00376 

-1.00040 

60-64 

49.96250 

-1.00093 

70-74 

49.98099 

-1.00090 

80-84 

50.02216 

-.99949 

90-94 

49.97216 

-1.00188 

Least  Squares  Solution 
of 

Sets  of  Sequential  Points 
Hypothetical  Data  and  Normalized  Random  Numbers 


X 

X-Intercept 

Slope 

N 

0-30 

50.30710 

-.99269 

31 

39-59 

50.09299 

-1.03749 

21 

34-64 

50.15715 

-.98349 

31 

44-54 

50.07964 

-1.12965 

11 

69-99 

49.78529 

-.99515 

31 

Table  1.  Frequency  (5  cycle  Interval)  and  voltage  data  obtained  with 
a Phase  Lock  Loop  Discriminator. 


299.900 

♦ 42 

300.000 

- 1 

299.905 

♦ 40 

300. 005 

- 4 

299.910 

♦ 37 

300.010 

* 6 

299.915 

♦ 36 

300.015 

- 8 

299.920 

+ 33 

300.020 

- 10 

299.925 

+ 31 

300.025 

- 12 

299.930 

+ 29 

300.030 

- 14 

299.935 

+ 26 

300.035 

- 16 

299.940 

+ 24 

300.040 

- 18 

299.945 

+ 22 

300.045 

- 20 

299.950 

+ 20 

300.050 

- 22 

299.955 

+ 18 

300.055 

- 25 

299.960 

+ 16 

300.060 

- 27 

299.965 

+ 14 

300.065 

- 29 

299.970 

+ 12 

300.070 

- 32 

299.975 

+ 10 

300.075 

- 35 

299.980 

♦ 8 

300.080 

- 37 

299.985 

+ 6 

300.085 

- 39 

299.990 

+ 4 

300.090 

- 41 

299.995 

+ 2 

300.095 

- 43 

300.100 

- 46 

Table  2.  Frequency  (10  cycle  Interval)  and  voltage  data  obtained  with 
a Phase  Lock  Loop  Discriminator. 


299.900  + 42 
299.910  + 39 
299.920  + 34 
299.930  + 29 
299.940  + 24 
299.950  + 20 
299.960  + 15 
299.970  + 10 
299.980  + 5 
299.990  + 1 
300.000  + 0 
300.010  - 0 
300.020  - 4 
300.030  > 10 
300.040  - 14 
300.050  - 19 
300.060  - 23 
300.070  - 28 
300.080  - 32 
300.090  - 37 
300.100  - 42 


Table  3.  Frequency  (100  cycle  Interval)  and  voltage  data  obtained  with 
a Pulse  Type  Discriminator. 


499.000 

499.100 

499.200 

499.300 

499.400 

499.500 

499.600 

499.700 

499.800 

499.900 

500.000 

500.100 

500.200 

500.300 

500.400 

500.500 

500.600 

500.700 

500.800 

500.900 

501.000 


5.068 

5.069 

5.070 

5.071 

5.072 

5.073 

5.074 

5.075 

5.076 

5.077 

5.078 

5.079 

5.080 

5.081 

5.082 

5.083 

5.084 

5.085 

5.086 

5.087 

5.088 


Plot  of  data  from  Table  2 showing  linear  response  of  Discriminator  measurement  with  rare  ( 
occurrence  of  an  apparent  shift  in  response  characteristic  In. the  vicinity  of  zero-doppler 


• -*5.068 

Plot  of  data  froa  Table  3.  showing  perfect  linear  response  of  Wscrlainator  to  Input  frequency, 
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ABSTRACT . It  is  becoming  increasingly  important  to  examine  available 
data  on  structural  behavior  in  the  assessment  of  design  criteria.  To  this 
end  a rationale  for  examination  and  selection  of  loading  criteria  baaed  upon 
available  data  is  proposed.  The  rationale  is  based  upon  examination  of  both 
structural  load  data  and  structural  failure  data.  This  paper  examines  in 
some  detail  the  assessment  of  structural  failure  data  and  extends  some  of  the 
same  ideas  to  structural  load  data.  In  order  to  extract  information  content 
from  collected  data,  a class  of  statistical  methods  applicable  to  the  data 
has  been  selected.  A matrix  correlating  data  parameters  with  statistical 
method  is  developed.  A method  for  assessing  the  overall  information  content 
of  the  collected  data  is  proposed.  Finally  recommendations  are  made  for 
future  collection  and  correlation  of  load  and  failure  data. 

1.  INTRODUCTION . A fundamental  problem  of  structural  engineering  is 
the  examination  and  selection  of  loading  criteria.  It  is  imperative  that 
any  solution  to  the  problem  center  around  a rationale  that  relates  informs- 
tion  available  on  loading  to  selsctsd  criteria.  Such  available  information 
Is  generally  in  the  form  of  data.  It  1b  the  purpose  of  this  brief  paper  to 
abstract  the  problem  and  outline  preliminary  work  on  a rational#  for  ad- 
dressing the  problem. 

The  paper  begins  by  defining  the  general  nature  of  the  problem.  Solu- 
tion to  the  problem  is  related  to  consideration  of  svsilshle  information 
in  the  form  of  data.  The  next  three  sections  of  the  paper  discuss  the 
initial  stages  of  a rationale  for  consistent  examination  and  selection  of 
loading  criteria.  The  first  of  the  sections  examines  available  information 
on  structural  load  and  the  sacond  examines  available  information  on  Instances 
of  structural  failure.  Classes  of  statistical  mathods  are  discussed  in  the 
third  section.  This  section  also  lncludas  discussion  of  s proposed  method 
for  assessing  the  overall  information  content  of  Che  available  data.  Finally, 
an  illustrative  example  of  application  of  a statistical  method  to 
loading  data  is  presented  and  the  paper  concludes  with  a discussion  of  future 
extension  to  this  preliminary  work. 


* "The  views  of  the  author  do  not  purport  to  reflect  the 
position  of  the  Department  of  the  Army  or  the  Department  of  Defense." 
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2.  GENERAL  NATURE  OF  THE  PROBLEM.  Examination  and  selection  of  loading 
criteria  involves  the  consideration  of  a statement  S with  quantifiers  that 
relate  variables  useful  in  description  of  the  load.  This  may  be  written  gen- 
erally as 


S:  C - C(F,r,a) 


(1) 


for 


S - loading  criteria  statement 
C - load  function 
T - time  vector 
T - space  vector 
a - parameter  vector 

The  statement  S has  as  quantifier  the  load  function  C which  is  expreBsable 
in  terms  of  time,  apace  and  a finite  set  of  parameters.  The  expression  is 
general  enough  to  allow  for  several  components  of  time,  space  and  parameters 
as  denoted  by  the  vector  notation.  The  problem  may  now  be  stated  in  terms  of 
examination  of  the  validity  of  S. 

Validity  of  S is  usually  established  through  some  subjective  and 
objective  evaluation  of  available  Information  related  to  S.  In  order  to 
be  consistent  in  this  evaluation  of  information  a rationale  for  carrying 
out  this  evaluation  must  be  set  forth.  The  preliminary  outline  of  the 
rationale  proposed  in  this  paper  centers  upon  a means  of  assessing 
available  information  related  to  S by  use  of  statistical  techniques, 
correlating  this  information  and  obtaining  quantitative  factors  upon  which 
the  validity  or  invalidity  of  S may  be  established.  In  a real  sense  this 
rationale  in  part  already  exists  in  that  statistical  interpretation  of 
collected  data  le  common  place  in  examination  of  load  data,  The 
discussion  to  follow  extends  this  rationale.  However,  it  is  important  to 
note  here  that  "all"  available  information  is  to  be  examined  in  evaluating 
the  validity  or  invalidity  of  S.  This  includes  consideration  of  load 
information  for  one.  In  addition,  since  the  invalidity  of  S tacitly  implies 
possibility  of  structural  failure  because  of  load,  one  must  also  consider 
structural  failure  information.  It  is  the  general  nature  of  these  sets  of 
information  that  provides  the  basis  for  this  preliminary  work  on  development 
of  the  rationale. 

3.  LOAD  INFORMATION.  Load  information  is  obtained  in  a quantity  termed 
a datum.  Such  datum  eay  be  in  a raw  form  or  in  a summary  form.  The  raw  form 
consists  of  the  most  basic  unit  and  results  from  direct  quant ixation  of  the 
phenomenon  under  observation.  The  summary  form  results  from  a transformation 
of  the  raw  datum. 

Upon  collection  of  data  on  loading  e.g.,  wind  loading, it  becomes  apparent 
that  some  wey  of  classifying  individual  pieces  of  datum  needs  to  be  devel- 
oped. Once  classified  then  groups  of  datum  within  any  one  designated  category 
could  be  examined  for  conalatancy  and  thaiz  relationship  to  the  proposed  loading 
criteria.  The  discussion  to  follow  defines  the  datum  classification  system  and 
the  example  In  Section  6 illustrates  application  of  a statistical  technique  to 
a place  of  datum  within  the  system. 
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The  requirements  on  a load  data  classification  system  are  very  bauic. 
First,  a single  piece  of  datum  aust  be  recognised  as  such  in  the  ayetea 
and  second  a piece  of  datua  aust  be  classifiable  within  the  systee.  In  order 
to  facilitate  this  a "generalised  random  process",  L,  is  defined  whose  "aample 
functions"  consist  of  pieces  of  datua  described  by  a set  of  parameters 
related  to  the  load  phenomenon1* . This  is  most  easily  expressed  as 

L(F,F,a)  - (1(F,F,T)  : FeT,  FeR,  ieA}  (2) 

where 


L(F,r,a)  - generalised  random  process 

1Ct‘,T‘,a)  - a piece  of  datua 
F - time  related  description  of  the  datua 
T - time  indexing  set 

F - space  related  description  of  the  datua 
R - apace  indexing  set 

a - parameter  related  description  of  the  datua 
A - paraaeter  indexing  set. 

It  is  assumed  that  every  piece  of  datua  related  to  a load  phenomenon  belongs 
to  L(F,r,a,  and  that  each  piece  of  datua  la  uniquely  defined  through  an  ordered 
triple  of  vectors  (F,r,i) . 

_ The  advantages  of  such  a means  of  classifying  data  by  evaluation  of 
F,  i and  a are  readily  apparent.  First,  in  evaluation  of  F,  F and  a datua 
sets  are  established  within  L(F,F,a)  that  relate  similar  information  about  the 
load  phenomenon  under  Investigation.  Second,  correlation  of  information 
through  evaluation  of  F,  r and  a allows  one  to  assets  the  overall  information 
content  of  the  available  data.  Third,  ready  evaluation  of  data  within  a 
given  datum  set  is  possible  and  links  aaongst  datum  sets  provide  a key  to 
links  amongst  data  within  different  datum  sets.  Finally,  tlus  approach  lends 
Itself  well  to  either  the  synthesis  approach  or  the  analytical  approach  to 
criteria  selection.  In  the  synthesis  approach  all  data  is  structured  into 
a description  of  the  load  phenomenon  and  criteria  are  selected  from  this 
description.  In  the  analytical  approach  the  datua  is  checked  sgainst  the 
proposed  criteria  for  consistency  end  selection  of  criteria  is  based  upon 
this  check.  In  either  csss  the  pertinent  datua  is  easily  identified. 

A total  of  twenty-two  parameters  that  must  be  evaluated  for  each  piece 
of  datum  are  selected.  The  twenty-two  parameters  may  be  divided  into  five 
groups.  Brief  mention  of  these  five  groups  will  suffice  for  the  present 
discussion.  The  first  group  consisting  of  two  pereswtera  uniquely  Identifies 
the  piece  of  datum.  The  second  group  consisting  of  five  parameters  identifies 
the  datum  by  defining  the  overall  load  phenomenon  properties,  e.  g.  static, 
deterministic,  atationarlty,  source,  spatial  extent.  The  third  group  con- 
sisting eight  parameters  describes  the  piece  of  datum  in  terms  of  the  time 


* Bleveted  numbers  denote  references. 
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history  information  available.  The  fourth  group  consisting  of  five  parameters 
describes  the  datum  in  terms  of  the  spectral  information  available.  Finally 
the  fifth  group  of  parameters  consisting  of  twoparameters  gives  a brief  narra- 
tive description  of  the  piece  of  datum  along  with  a source  reference. 

This  general  scheme  of  datum  referencing  permits  a consistent  examination 
of  structural  load  data  and  allows  for  easy  construction  of  a structural  load 

data  base. 

4.  STRUCTURAL  FAILURE  INFORMATION.  The  consideration  of  data  on  instances 
of  structural  failure  in  the  preliminary  stages  of  development  has  provided  for 
some  most  interesting  ideas  on  structuring  of  data  from  diverse  and  complex  phen- 
omenon. In  the  case  of  structural  load  data,  the  content  of  individual  pieces 
of  datum  ia  described  in  terms  of  a set  of  parameters  and  the  raw  or  summary 
form  data  is  contained  within  each  of  the  random  process  sample  functions.  Data 
analysis  was  assumed  to  take  place  on  a "level  below"  the  datum  structure, 

L(F,  r’.a) . For  the  case  of  structural  failure  the  nature  of  the  available  date 
and  information  desired  from  the  data  requires  that  the  description  of  the  datum 
l.e.,  instance  of  structural  failure,  be  complete  enough  for  data  analysis  on 
ths  datum  structure  level.  That  is,  the  analogous  structural  failure  "generalized 
random  process"  should  contain  all  the  available  information  concerning  the 
structural  failure.  This  approach  to  structural  failure  datum  is  the  product  of 
several  considerations.  First,  quantitative  structural  failure  data  ia  difficult 
to  obtain  since  few  instances  of  structural  failure  are  instrumented . Second, 
unless  the  failure  is  controlled  in  some  manner,  quantitative  date  tends  to  bs 
meaningless  because  of  the  complex  load-response  path  that  usually  describes 
the  failure.  Third,  any  one  case  of  structural  failure  is  but  one  of  many 
possible  structural  failures  and  It  may  or  may  not  share  properties  in  conuon 
with  other  cases  of  structural  failure.  Fourth,  detailed  quantitative  data 
from  instrumentation  of  a structural  failure  would  present  a prohibitively 
high  collection  and  reduction  coat  to  Information  ratio.  Finally,  detailed 
reduction  of  quantitative  data  obtained  during  or  after  structural  failure 
would  tend  to  de-emphaslze  the  overall  characteristics  of  the  structural  failure. 
Thua,  structural  failure  data  is  considered  in  the  following  way. 

It  was  hypothesized  that  structural  failure  may  be  considered  a "generalized 
random  process"1.  Thus,  it  may  be  represented  by  an  expression 

S(T,r ,1)  - {s(r,r,a );  TeT,  rcR,  *eA}  (3) 


sCT,£,i)  - structural  failure  generalized  random  process 
- structural  failure  sample  function 
f - time  vector 
T - time  indexing  set 
r - spatial  vector 
R - spatial  indexing  set? 
a - parameter  vector 
A - parameter  indexing  net. 


r*w:'*a 


All  instances  of  structural  failure  belong  to  S(TT,r,a)  and  every  failure  is  in 
SCfyr.a)  either  explicitly  through  collected  data  and  parameter  evaluation  or 
implicitly  in  cases  where  the  structural  failure  is  unrecorded  but  the  indexing 
sets  are  broad  enough  for  the  description • The  problem  of  structural  failure 
data  structuring  now  becomes  a matter  of  defining  T,  R and  A and  evaluating 
T,  F and  a from  collected  data  on  structural  failure. 

Forty-five  parameters  are  considered  adequate  to  define  the  structural 
failure  random  process,  l.a.,  forty-five  parameters  are  considered  sufficient 
t to  describe  any  instance  of  structural  failure.  Obviously,  only  the  overall 

gross  characteristics  of  an  instance  of  structural  failure  are  considered 
appropriate  for  description  and  moat  pertinent  to  the  overall  rationale. 

The  forty-five  parameters  fall  into  nine  major  categories.  For  the 
sake  of  brevity  these  nine  major  categories  will  be  listed  with  a few 
conments  regarding  the  parameters  within  each  category. 

1.  identification  - This  category  Includes  information  on  the  source 
and  information  content  of  the  structural  failure  data  available. 

2.  structure  characteristic  information  - This  category  includes  all 
information  related  to  the  structure  that  experienced  the  failure. 

The  dates  of  construction  and  failure  are  recorded  along  with  the 
general  structural,  material  and  functional  character 1st ica  of  the 
structure.  The  geometrical  dimensions  of  the  structure  along  with 
those  of  the  failed  portion  of  the  structure  are  also  recorded. 

3.  general  failure  description  - This  category  describes  the  cause 
of  the  failure,  the  extent  of  the  failure  both  in  qualitative 
and  quantitative  tanas,  the  nature  of  the  failure  in  terms  of 
its  possible  progressive  or  nonprogressive  characteristics,  hori- 
zontal or  vertical  characteristics,  the  total  time  of  the  failure 
and  the  stages  of  the  failure. 

A.  global  failure  description  - For  failures  in  which  a major  portion 
of  the  overall  structure  has  failed  the  failure  takes  on  a global 
nature.  This  is  subsequently  described  by  three  paraswters  naming 
elements  of  the  structures  that  failed,  modes  of  failure,  and  mat- 
erial composing  the  failed  elements  of  the  structure. 

5.  local  failure  description  - A failure  of  a structure  may  include  a 
small  portion  of  the  overall  structure  in  which  case  the  failure 
takes  on  a local  nature.  The  same  three  parameters  as  for  the 
global  failure  description  provide  for  the  local  failure  description. 

6.  global  load  description  - Loading  on  a structure  that  is  over  a 
large  portion  of  the  structure  may  be  termed  a global  load.  It 

is  described  in  terms  of  four  parameters  including  identification, 
general  dimensions,  a general  statement  and  estimated  value  if 
this  is  available  or  able  to  be  deduced. 
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7.  lr cal  load  description  - Loading  on  a structure  that  Is  over  a 
snail  portion  of  tha  structure  may  be  termed  a local  load.  The 
same  four  parameters  as  In  the  case  of  global  load  description 
describe  the  local  load. 

8.  load  - failure  relationship  - In  moat  instances  of  structural 
failure  there  exists  a general  spatial  relationship  between  load 
and  failure.  This  relationship  may  be  expressed  in  terms  of  local 
load  - local  failure,  global  load  - local  failure,  local  load  - 
global  failure,  global  load  - global  failure.  This  parameter 
provides  Insight  into  tbe  nature  of  the  extent  of  the  loading 

and  the  corresponding  failure. 

9.  general  statement  - This  final  parameter  group  consisting 

of  one  parameter  Is  a general  statement  about  tha  failure  and 
its  cause. 

Here  again  It  is  well  to  take  note  that  structural  failure  does  not  relate 
well  to  phenomenological  description  because  of  its  complexity.  The  categories 
of  parameters  and  the  parameters  themselves  provide  for  an  overall  view  of  the 
structural  failure  process.  Given  data  on  structural  failure  the  parameters 
of  S(F,r,a)  can  be  evaluated  and  S(T,F,7f)  better  defined.  The  statistical 
techniques  to  be  discussed  in  tha  next  section  are  applied  directly  to  the 
parometere  of  S(T,F,5) . 

5.  BASIC  CONSIDERATIONS  FOR  STATISTICAL  ANALYSIS.  The  nature  of  the 
problem  under  consideration  and  the  number  of  statistical  techniques  appli- 
cable to  the  problem  make  it  possible  to  consider  only  a few  topics  in  re- 
lating statistical  methods  to  the  datum  within  the  framework  of  the  load 
and  failure  generalized  random  processes  discussed  above. 

One  of  the  first  considerations  in  applying  statistical  methods  to  data 
defining  the  processes  above  la  an  examination  of  the  way  In  which  the  data 
la  measured.  There  exist  four  acceptable  statistical  data  measures  by  which 
the  measure  of  data  is  defined2*3.  Listed  in  order  from  least  to  moat  power- 
ful they  are  as  follows:  nominal,  ordinal,  interval  and  ratio.  A brief 

description  of  each  is  in  order.  The  nominal  measure  applied  to  data  implies 
the  data  may  be  categorized  according  to  a set  of  mutually  exclusive  conditions. 
The  ordinal  data  measure  applied  to  data  implies  there  exists  an  order  relation- 
ship amongst  pieces  of  the  datum.  The  interval  data  measure  applied  to  data 
implies  e relationship  of  the  form 

x -y  > 0,  x - y - 0 or  x - y < 0 (4) 


exists  between  any  two  pieces  of  datum.  Finally,  the  ratio  data  measure  applied 
to  data  implies  numerical  relationships  for  the  datum  are  available  and  for 
y f 0,  x/y  is  a meaningful  expression  between  any  two  places  of  datum. 

Although  there  are  a number  of  ways  of  dividing  statistical  methods  into 
catagoriea  for  purposes  of  this  discussion,  perhaps  the  categories  distribution 
and  distribution  free  will  suffice.  Distribution  related  statistical  methods 
In  general  correlate  with  instances  in  which  distribution  functions  with  a finite 


number  of  parameters  may  be  utilized  in  the  atatlatical  analysis  of  the 
data.  Distribution  free  related  statistical  methods  in  general  corre- 
late with  Instances  in  which  lesser  restrictions  are  imposed  upon  condi- 
tions that  must  be  satisfied  for  application  of  the  method  to  a given 
set  of  data.  These  statistical  methods  may  be  further  subdivided  into 
methods  concerned  with  point  estimates  of  parameters,  confidence  regions 
for  parameters  or  significance  tests  for  parameters. 

In  the  illustrative  example  to  follow  a distribution  free  statistical 
method  is  applied  to  a piece  of  load  datum.  In  general  distribution  related 
methods  apply  well  to  load  data  because  of  its  tendency  to  be  describable  in 
terms  of  the  ratio  data  measure  and  distribution  free  related  statistical 
methods  apply  well  to  structural  failure  data  because  of  its  tendency  to  be 
describable  in  terms  of  data  measures  lass  powerful  than  the  ratio  measure. 

In  work  to  date  emphasis  has  been  placed  on  consideration  of  structural 
failure  data.  It  has  become  important  to  consider  categorical  dlatribution 
free  statistical  techniques  for  use  on  parameters  of  the  structural  failure 
random  process.  Categorical  techniques  are  most  applicable  because  structural 
failure  data  is  for  the  moat  part  of  a categorical  nature.  Distribution  free 
techniques  are  most  applicable  because  of  the  difficulty  in  determining  the 
distributions  and  their  related  parameters  resulting  from  unavailability  of 
large  amounts  of  data. 

It  is  found  useful  when  considering  the  structural  failure  generalized 
random  process  to  construct  a statistical  method  - process  parameter  matrix 
whereby  statistical  methods  applicable  to  given  process  parameters  are  corre- 
lated one  to  another.  Table  1 below  provides  a segment  of  this  matrix. 


'\SM 

1 2 3 4 5 

FP 

6 

XXX 

7 

XX  X 

8 

XX  X 

9 

Key: 

Failure  Parameter  (FP) 

6 descriptive  name 

7 construction  date 

8 failure  date 

9 structural 


Statistical  Method  (SM) 

1 binominal  test 

2 chi-square  test  for 
goodness  of  fit 

3 Wald  - Wolfowitz  run  tei 

4 quantile  test 


Table  1:  Statistical  Method  - Failure  Process  Parameter  Matrix 

The  construction  of  the  matrix  in  Table  1 leads  naturally  to  an  assess- 
ment of  the  overall  information  content  of  a set  of  data  based  upon  an  eval- 
uation of  factors  useful  in  defining  the  overall  characteristics  of  a statis- 
tical method5.  Table  ?.  lists  factors  useful  in  evaluating  the  effectiveness 
of  a statistical  method  along  with  proposed  weights  for  these  factors.  The 
overall  information  content  of  a set  of  data  is  determined  by  associating  a 
set  of  statistical  techniques  with  the  data  and  proceeding  to  tabulate  weight 
values  for  the  various  factors.  A relative  measure  of  information  content 
amongst  sets  of  data  is  obtained. 
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statistical  data  measure  (10) 

nominal  2 

ordinal  4 

interval  6 

ratio  10 

sample  size  10 

data  transformation  and  restrictions  on 

data  parameters  2 

level  of  computational  effort  2 

extent  of  use  of  symmetry  2 

sensitivity  of  procedure  to  assumptions  4 

precision  level  (10) 

exact  10 

theoretical  approximate  7 

judgment  empirical  4 

efficiency  of  method  10 

consistency  of  method  10 

sensitivity  of  procedure  to  assumptions  and 

difficulty  in  verifying  assumptions  10 

population  properties  and  importance  amongst 

other  data  groups  5 


Table  2:  Factors  for  Evaluating  the  Effectiveness  of  a Statistical 

Method  with  Weights 

There  exist  several  major  weaknesses  in  the  approach.  First , not  all 
statistical  methods  may  be  accurately  evaluated  in  terms  of  these  factors. 
Second,  it  presumes  that  one  has  selected  an  optimal  set  of  statistical 
methods  to  operate  on  a given  set  of  statistical  data.  Third,  it  presumes 
that  data  information  content  is  related  to  abstract  measures  on  the  statis" 
tical  method  Independent  of  the  data.  Finally,  it  assumes  the  weighting 
factors  are  accurate  and  constant  over  the  ranges  of  statistical  methods. 

Despite  these  weaknesses  a matrix  relating  statistical  method  versus  weight- 
ing factor  provides  for  a crude  messure  of  the  relative  Information  content 
of  a set  of  data  to  which  the  statistical  method  may  be  applied. 

m 

6.  AN  ILLUSTRATIVE  EXAMPLE.  The  example  in  this  section  of  the  paper 
is  illustrative  in  the  sense  that  (a)  it  is  not  based  upon  all  the  data  that 
is  available  and  (b)  it  presents  a rather  new  approach  in  the  reduction  of 
civil  engineering  data.  The  first  point  is  a result  of  the  preliminary  nature 
of  this  work  and  ability  to  reduce  only  a portion  of  the  data  available.  The 
second  point  refers  to  the  use  of  distribution  free  statistical  techniques  on 
the  selected  data.  In  general,  measurement  distribution  oriented  statistical 
techniques  are  used  on  numerical  data  resulting  from  a well  controlled  experi- 
ment. The  results  of  the  statistical  analysis  are  then  presented  in  some  con- 
cise form.  Distribution  free  statistical  techniques  are  often  times  related 
to  a statistical  hypothesis  test  that  may  or  may  not  be  associated  with  para* 
meters  describing  the  data  e.g.,  trend  or  randomness  of  data  may  be  under  inves- 
tigation. 
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It  is  also  ¥*n  to  point  out  that  tha  conclusions  drawn  from  the 
illustrative  example  nay  seen  trivial,  however,  each  ®Wle  conclusion 
presents  only  . .inuM  pUc.  of  fnfomntfon  Mtor-Jlng  th..  nhtok  1.  .1 
ready  known  about  the  case  under  investigation.  That  is  to  say,  the 
effectiveness  in  use  of  techniques  in  this  way  cones  by  way  °* 
eion  of  an  overall  view  of  tha  case  by  naans  of  statistics.  This  implies 
l t L"  ~ yV.  .Cl.tl..l  c.chnl,u««  1»  dlff.r.«t  «y.  to  th. 
.«lUbU.  Firton.t.1,.  one.  . tat.  >...  hn.  bo.ncon.ttnet.ijnn 
the  atntlotleol  tochnlquoo  .ninety,  thli  bocono.  n rather  nlnp  e 
au tons tic  procedure. 

The  illustrative  example  presented  here  concerns  i“***|;iJ**1™/* 
the  relationship  between  the  set  of  value  sfroe  coUect.d  data  ^^the 
set  of  values  assigned  by  a criteria  statemant.  In  the  context  or  tne 
previous  discussion.  Table  3 provides  a statement  of  structural  loading 

criteria. 

Criteria:  Following  table  for  average  pressure  coefficients  shall  be 

used  for  calculating  pressures  on  external  surfaces  of 
buildings . 

Location  of  Wall  cpe 

Windward  wall 

Leeward  wall,  both  height-width 
and  height-length  ratios 'i  2.5  -0.6 

Other  buildings 

Side  Walls  ■“°*7 

Table  3:  Criteria  for  External  Pressure  Coefficients  for  Walls,  Cp 

Some  of  the  data  related  to  this  criteria  acquired  from  wind  tunnel 
testing  is  provided  in  Table  A.  In  this  table  the  external  pressure  coef- 
ficient on  a structures  wall  ia  tabulated  for  two  angles  of  Incidence  (0 
and145‘)  to  the  building  wall  K with  the  unprimed  letters  representing  data 
for  building  sides  of  0*  of  incidence  and  the  primed  letters  representing 
data  for  building  sides  of  45*  of  incidence. 


h:b:L  A B C u * \ \ - 5 

1:1:1  .9  “.5  ~j\  *4  m\\ 

CC*  2. 5:2:5  .9  -.5  -.7  -7  .6  .3  -J  j 

v 2. 5:2:5  .9  -.5  -.7  -.7  .b  _ 

V 2. 5:2:5  .9  -.5  -.8  -.8  -6  -5  • 

$ IB’  1:4:4  .9  -.3  -.4  -.4  .5  -.4  .5  . 

* 1:8:16  .8  -.5  -.5  -.3  .5  -.3  .* 

— H 2. 5:1:1  .9  -.6  -.7  -.7  .5  -.5  .5 

DD  2:1:2  .9  -.5  -.8  -J  •«  “•*  I* 

1:2.4:12  .9  -.5  -.b  “•*  “*$  \ _*« 

1:1:5  .9  -.5  -.6  -.6  .5  -.8 

Table  4:  Structural  Configuration  and  External  Pressure  Coefficient  C at 

Angles  of  Incidence  of  0*  and  45*4  pe 
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For  this  illustrative  example  it  is  to  bs  determined  if  there  is  a 
significant  difference  between  the  numbers  representing  the  criteria  end 
those  derived  from  the  smell  amount  of  available  wind  tunnel  data.  This 
is  perhaps  better  stated  by  inquiring  of  some  statistlcel  measure  of  the 
representation  of  the  wind  tunnel  data  by  the  criteria.  However,  it  is 
important  to  note  here  that  the  numbers  of  Table  3 and  Table  4 do  not  rep- 
resent random  sample  values  from  a general  population.  For  thla  illustra- 
tive example  it  is  assumed  that  there  axiat  ten  categories  of  structural 
configuration  defined  by  the  h,  b,  L ratios  of  Table  4.  In  addition  it 
is  assumed  that  initially  a number  of  structures  in  each  category  are  de- 
signed on  the  basis  of  the  data  of  Table  4.  At  a later  date  the  same  num- 
ber of  structures  in  the  respective  categories  are  designed  on  the  basis 
of  data  of  Table  3.  There  exists  then  a population  of  designs  associated 
with  Table  3 and  a population  of  designs  ?2  associated  with  Table  4.  Suppose 
then  a random  sample  of  designs  is  taken  from  and  Po  being  careful  to 
select  one  and  only  one  design  from  each  category  of  the  two  populations. 

In  effect  then  the  tabulation  of  design  values  for  Cp#  from  population  ?2 
results  in  Table  4 and  to  each  value  of  Table  4 corresponds  a value  of  Table 
3.  From  this  discussion,  one  may  surmise  that  the  example  is  quite  artifidel, 
however,  one  must  note  the  objective  of  the  consideration  is  to  determine  in 
some  statistical  way  the  difference  between  a statement  of  criteria  and  a 
small  amount  of  data  available  for  evaluating  the  criteria.  Aa  a matter  of 
fact  the  small  amount  of  available  data  may  have  been  used  in  engendering  the 
criteria. 

I 

The  "statistical  measure"  for  examining  the  difference  in  the  artificial 
constructed  populations  P.  and  P^  is  the  Uilcoxon  signed  rank  teit^  * ' . This 
test  is  very  likely  not  the  most  effective  test  that  might  be  applied  in  this 
Instance,  however,  it  is  easy  to  apply  and  should  yield  some  information  rela- 
tive to  the  question  being  asked.  The  test  assumes  samples  of  paired  repli- 
cates with  a model  defined  by 

Zi  " Yi  " Xi  " 9 + *i  1-1 n (5) 

where 

- sample  values  from  population  P2 

- sample  values  from  population  Pi 

0 - unknown  parameter  of  interest  ("treatment"  effect) 
a^  - unobservable  mutually  independent  random  variables  from  a 
continuous  population  synatrlc  about  0 

The  hypothesis  to  be  tasted  is 

s 0 ■ 0 against  the  alternative  hypothesis  : 0 jl  0 (6) 

If  the  hypothesis  is  accepted  at  a prescribed  level  the  criteria  of  Table  3 
will  be  considered  an  adequate  represer cation  for  the  data  of  Table  4 and  if 
the  hypothesis  is  rejected  the  criteria  will  be  considered  inadequate  for 
representation  of  the  data.  It  should  be  cautioned  that  (1)  a level  of  sig- 
nificance for  the  test  is  somewhat  arbitrary  at  this  point  and  no  specific 
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guidance  is  available  for  eelaction  of  a leva!  that  vill  provide  a solid 
confidence  in  acceptance  or  rejection  of  Hfl  and  (2)  the  data  la  got  com- 
plete. Utiliziang  the  teat  atatiatlc  T+  for  email  samples  and  T*  for 
large  samples  where 


P+  a 


2. 


i-1 


(7) 


n - 


the  rank  of  |Z.| 
, 1 if  Z±  > 0 
1 0 if  Z7  < 0 
the  sample  size 


1 * l,n 


and 


T+  - [n  (n+l)/4] 

(tn(n+l)  (2n+l)  m % Ijtj  (tj-1)  (tj+l)]/24] 

g - number  of  tied  groups 
t . - sics  of  the  tied  group  j. 


(8) 


Selecting  the  level  of  significance  to  be  0.01  and  considering  various  combi- 
nations of  the  data  of  Table  4 matched  with  the  criteria  of  Table  3 the  results 
are  tabulated  in  Table  5. 


Criteria  e gainst 

n 

T*  or  T+ 

A 

9 

45 

B 

1 

1 

C 

7 

22 

D 

7 

22 

A’ 

10 

0 

B' 

4 

4 

C* 

10 

55 

D' 

10 

55 

ABCD 

24 

3.34 

A'B’C'D' 

34 

2.64 

Table  5:  Tabulation  of  Statistics  and 
Rank  Test  for  a « 0.01  (two  tall  taat) 
Pressure  Coefficient. 


Decision 

reject  H„ 
no  table  values 
accept  Hq 
accept  Hq 
reject  HQ 
accept  H 
reject  H 
reject  ST 
reject  H 
reject 

o 

Decision  for  the  Wllcoxon  Signed 
for  Criteria  and  Data  on  the  Ixtetaal 


It  will  be  noted  that  the  criteria  is  apparently  an  adequate  description 
for  C in  three  of  the  nine  cases  tested.  Again  some  caution  needs  to 
be  exercised  in  drawing  conclusions  from  the  illustrative  example  since 
there  la  little  guidance  available  on  levels  of  teat  significance  and  the 
limited  amount  of  date. 
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7.  CONCLUSION.  The  above  repreeents  e very  preliminary  basis  for  e statis- 
tical examination  of  the  load  and  failure  of  structures  and  a rational  approach 
to  examining  available  inforaation  related  to  loading  criteria.  The  next  stage 
in  the  development  will  consider  construction  of  e data  base  of  available  data 
along  with  establishing  a broader  group  of  statistical  techniques.  This  should 
lead  to  the  consideration  of  mathematical  pattern  language  in  the  correlation  of 
collected  data  end  in  the  utilisation  of  appropriate  statistical  techniques  on 
the  collected  data.  In  addition  it  la  anticipated  that  more  advanced  mathemat- 
ical techniques  e.g. , in  the  area  of  combinatorial  methods  will  be  used  for 
investigating  general  relationships  amongst  the  diverse  pieces  of  datum. 

The  ideas  expressed  above  form  a basis  for  a rationale  for  the  examination 
and  selection  of  load  criteria.  The  rationale  is  based  upon  a consistent  and 
thorough  statistical  analysis  of  available  load  data  and  failure,  data.  Given 
the  statesMat  S representative  of  e statement  of  load  criteria  the  validity 
of  S is  deduced  from  the  consistent  and  thorough  statistical  analysis  of  all 
available  data.  Work  to  date  described  above  is  a first  step  in  the  rationale 
development , 
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STRAIN  GAGE  INSTRUMENTATION  FOR  AMMJNITION  TESTING 


Paul  D,  Flynn 
Pltnuin-Dunn  Laboratory 
Frankford  Arsenal 
Philadelphia,  Pennsylvania 


ABSTRACT.  In  connection  with  a modernisation  program  on  the  menu* 
facture  of  small  caliber  asnunitlon,  it  was  recognised  that  ballistic 
pressure  measurestents  would  have  to  be  automated  in  order  to  keep  pace 
with  increased  rates  of  production.  Copper  crusher  pressure  gages  with 
individual  measurements  of  compressed  cylinders  would  be  too  slow.  This 
paper  deals  with  a feasibility  study  on  the  use  of  electrical  resistance 
strain  gages  for  quality  assurance  testing  of  ammunition. 

Although  the  method  of  using  external  strain  gages  to  determine  in- 
ternal ballistic  pressures  is  well  known,  a new  arrangement  of  strain 
gages  was  developed  to  measure  directly  the  quantity  (c  + ve  ) on  the 

0 2 

outer  surface  of  a test  barrel  (where  a , e are  the  circumferential  and 

0 * 

longitudinal  strains,  respectively,  and  y is  Poisson's  ratio),  From 
Hooke's  law,  the  combined  strain  signal  was  proportional  to  the  cir- 
cumferential stress  in  the  barrel  at  the  outer  surface.  Using  Lamd's 
solution,  this  stress  was  related  to  the  internal  pressure.  Thus,  the 
strain  gaged  test  barrel  acted  as  its  own  pressure  transducer. 

Experiments  were  designed  to  compare  the  results  of  ballistic  firings 
with  three  types  of  ammunition,  two  test  barrels,  and  pressures  at  several 
locations.  It  was  concluded  that  the  strain  gage  method  is  feasible  for 
acceptance  testing  of  ammunition. 

NOTE.  Published  by  the  Society  for  Experimental  Stress  Analysis  in 
its  Journal  on  Experimental  Mechanics.  Volume  15,  1975. 
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STATISTICAL  INVESTIGATION  INTO  PULLE  CHARGING 
OP  NICKSL-CADmIUta  BATTERIES 

Walter  Kasian 
Maintenance  Directorate 
and 

Erwin  Biser 
Avionics  Laboratory 
U.  3.  Army  Electronics  Command 
Port  Monmouth,  New  Jersey 


ABSTRACT . The  common  net hods  of  charging  vented 
aircraft  nickel-cadmium  batteries  are  constant  current, 
constant  potential  and  modified  constant  potential  (current 
limited)*  However,  through  continuous  recharging  oy  these 
methods,  nickel-cadmium  batteries  develop  a "memory  effect" 
caused  by  passivation  of  the  battery's  positive  cell  plate 
material  ( nickel-oxide)  ana  "fadeout"  caused  oy  crystal 
growth  of  the  negative  cell  plate  material  (cadmium).  These 
two  phenomena  gradually  and  continually  lessen  battery 
charge  acceptance  which  in  turn  lessens  the  battery  output. 

Pulse  charging,  however,  has  shown  a significant 
effect  in  eliminating  battery  "fadeout"  and  "memory  effect". 
Thus  pulse  charging  can  eliminate  the  required  periodic 
cycling  to  rejuvenate  the  batteries  and  possible  increase 
the  battery  cycle  life.  The  pulse  charging  of  nickel-cuumium 
Lotteries  has  been  completed  on  two  new  and  two  used  batteries 
in  all  possible  comoinations  of  the  following  charge  vuri- 
aolesj  three  different  pulse  amplitudes,  three  different 
charge  rates  and  two  different  per-oent  overcharge  rates. 

This  investigation  entails  analysis  of  the  mean 
response  (oattery  output)  and  response  variability  to 
determine  the  optimum  combination  of  pulse  amplitude, 
charge  rate  and  per-cent  overcharge  in  charging  new  and 
used  nickel-oadmlum  batteries.  Similar  analysis  is  per- 
formed to  determine  the  optimum  combination  of  the  variables 
for  greatest  battery  efficiency. 

1.  INTRODUCTION . The  Army  nickel-cadmium  oattery, 
on  which  this  Investigation  was  performed,  is  nomenolatured 
as  the  BB-45JO/U,  i3  rechargeaole  and  is  rated  at  24  ampere- 
hours  | that  is,  it  is  capable  of  supplying  J4  amperes  of 
current  at  a constant  rate  for  1 hour  at  a nominal  voltage 
of  24  volts.  This  is  just  over  600  watts  of  power.  The 
oattery  is  used  primarily  to  start  Army  aircraft  and  to 
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supply  power  to  airborne  electronics  equipment  ana  the 
Vulcan  Air  Defense  System. 

The  reason  an  investigation  was  performed  on  this 
oattery  is  because  of  its  high  density,  approximately 
5,000  of  these  oatteries  are  deployed  ana  oecaase  of  the 
severe  maintenance  proolems  encountered  with  these  batteries. 
After  every  100  hours  of  use,  these  oatteries  have  to  come 
into  maintenance  shops  for  reconditioning  which  can  take 
anywhere  from  20  to  50  hours  if  the  oatteries  are  good. 

The  reason  for  periodic  maintenance  is  necessitated  oy  the 
fact  that  the  present  recharging  techniques,  constant 
current  and  modified  constant  potential,  whether  j.n  a 
battery  operating  system  or  maintenance  snop  gradually 
lessen  battery  charge  acceptance  which  in  turn  lessens 
battery  output.  Fulse  charging  however  eliminates  the 
maladies  associated  with  contant  current  ox*  constant  potent- 
ial charging,  i.e.,  fadcout  (cuumium  crystal  growth  on  the 
cattery  negative  plates)  ana  memory  effect  (pass- vat ion  of 
the  nichcl-oxide  on  the  oat  ter-*  positive  lates). 

The  three  different  charing  metnoas  are  de..  icted  in 
Figure  1.  As  constant  current  implies,  a constant  current 
is  applied  to  the  oattery  for  a specified  * time , usually 
limited  to  the  time  that  the  oattery  receives  its  rated 
capacity.  In  modified  constant  potential,  the  oattery 
ura,.s  current  until  it  is  charge  to  a certain  specified 
voltage.  The  maximum  current  is  usually  limited  to  the 
oattery  rating.  In  pulse  charging,  the  input  pulsed  current 
can  be  of  any  amplitude  with  an  average  input  value , as  in 
this  case  up  to  the  oattery  x-ating  of  34  ampere-hours . 

The  pulse  charging  of  the  oB— 4550/U  is  depicted  in 
Figure  2.  The  average  main  current  (I*/)  input  at  any  pulse 
amplitude  is  applied  for  a time  T.  . inis  is  the  time 
the  oattery  receives  approximately  100a»  of  its  capacity. 

It  is  then  overcharged  at  onc-thirdthe  main  current  by  a 
certain  >4  of  the  time  it  underwent  its  main  charge.  In 
this  case,  at  20/4  and  40.  u overcharge,  because  the  oattery 
is  not  10Q$  efficient,  overcharging  is  required  so  that 
we  can  ootain  at  least  100. * of  the  cattery* s ratec.  output 
on  discharge.  The  automatic  Pulse  Charger,  ..ouel  3000A, 
developed  oy  Utah  he search  ana  iiovclopment  Company  speci- 
fically for  the  Art:y  was  employed  for  all  the  pulse 
chax*gin»,s  in  this  investigation. 

Alter  char^in.,  and  a four  hour  rest  period,  t..e 
oatteries  were  discharged  at  half  their  rated  capacity, 
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17  amperes,  to  a battery  end  voltage  of  19.2  volts,  the 
point  at  which  10091  of  the  cattery' b capacity  has  been 
removed.  All  charging  and  discharging  was  monitored  oy 
external  test  equipment  to  provide  data  accuracy  to  1/4. 

The  dB-423  dischargo  characteristic  is  depicted  in  Figure  3, 

2 . STBU CTUHH  0 F THK  MOUkL . This  eaqperiment  consisted 
of  the  following  factors i As  Feak  Pulse,  a a Charge  Current. 

C * % Topping  or  Overcharge  and  D * battery  Type  (new  or  used;. 
The  factors  and  the  levels  of  those  factors  are  desorlOed  in 
Figure  4.  Since  the  levels  of  the  factors  are  at  fixed 
values,  the  model  Is  thus  a fixed  model.  The  values  of  the 
output  (ampere-hours)  and  efficiencies  obtained  from 
bB-433's  of  two  different  manufacturers  (batteries  #1  and 
#2  of  manufacturer  X and  batteries  #3  aha  #4  of  manufacturer 
Y)  are  3hown  in  Figures  5 through  8.  Two  ouservations 
were  ootained  for  each  combination  of  the  different  levels. 
Efficiency  was  calculated  as  the  ampere-hour  output  divided 
oy  the  ampere-hour  input  for  each  battery  charge  - discharge 
cycle. 

The  mathematical  model  adopted  can  oe  expressed  as 
follows i 

XIJKL(M)  " U 4 AI  + BJ  + °K  4 DL  ♦ ^IJ  + a0JK  + 0DKL  * 
ACIK  + ^11  * ^JL  + A£CIJ4  + • • • + *al)j KL  ♦ 
EIJKL(M) 

1,J  t 0,1,2 

K,L  i 0,1 

whore  XxjKL(M)  * ooe®rved  random  variaule  (output  or  efficiency) 

U = grand  average  or  effect  due  to  the  mean 
Aj,fij  " e^ec^  due  to  main  effects 

Aiij-j,...,  bCDj^  b effect  due  to  Interactions 

EIJKL(M)  s p<uad0®  •pror 

Hypothesis  tested) 

Hx  ) Az  = 0 for  all  I 

H2  i flj  5—  0 for  all  J 

H3  1 °K  = 0 lor  Ril  K 

H4  j J)^  . 0 for  all  1 

Hi?  * ^IJ  b 0 for  all  i and  J 

Similarly  for  the  interactions  bOjk,  OD^,  etc. 
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Conolderation  was  given  to  confounding  the  52  design. 
Since  we  are  limited  to  two  observations  per  coll,  we  have, 
altogether  only  seventy- two  (72)  observations.  Confounding 
would  introduce  unwarranted  complications  and  would  be 
counterproductive.  Confounding  was  thus  avoided. 


The  results  of  the  analysis  of  variance  are  shown  in 
Figures  9 through  12.  The  results  indicate  that  for  the 
output  of  batteries  yl  tnd  >f 2,  the  only  significant  differ- 
ence exists  in  the  interaction  oe tween  factors  C ( /»  over- 
charge) and  D (battery  condition;  f ^ Q ^ = 2.83.  While 

for  the  efficiency,  defined  as  per  cent  output  .divided  Dy 
input  in  ampe  re  -hours,  times  100,  of  oattorios  ft  1 aad  if  2 the 
interaction  between  factors  C and  D was  significant  and  the 
main  effect  C was  overwhelmingly  significant. 


For  the  output  of  batteries  #5  and  #4,  significant 
differences  existed  in  the  main  effect  C,  main  effect  D and 
the  interaction  oetween  effects  C and  D;  f ^ ^ * 2.83. 

For  the  efficiencies  of  batteries  ,1 3 and #4,  main  effect  C 
(%  overcharge)  and  the  interaction  Detween  effects  B (charge 
rate),  C OS  overcharge)  and  D (battery  type)  showed 
significant  differences  at  the  93*  level. 

Though  the  F test  performed  above  may  reject  the  null 
hypothesis  that  the  means  are  equal,  it  doe3  not  tell  us  which 
means  are  siLnificantly  different  from  which.  Schefl'e.' 
proposed  a system  of  procedure  for  tiiis  problem  which  we 
shall  employ. 


SClIflFFB  t-'-niOO  FOR  liULTIPLi!.*  CCh.PARISOil  OF  i.JSAnS. 


3*  ^iT'rr*\  *•*• * Jn* * wwi,,a  v.-*.-  u 

Suppose  we  havo  estimates  of  true  means  «,  . , wxEE 

variances  s , s^  oeinj  estimated  with  f degrees  of 
freedom.  '.Vo  are  interested  in  contrasts,  defined  as 


where 


k 

{e‘ 


= 0, 


e - r e^i, 

The  contrast 

x 

H " * c.x, 

1 1 i( 
k , 02 

V(H)  « E c . 

i 1 ai 

The  estimated  variance  of  H is 

- , , * . .2  k c? 

VlH)  * * of  —•  a BZr.  ~ 
i 1 ni  i ni 


e is  estimated  as 


with  variance 


(3.1) 

(3.2) 


(3.3) 
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Scheffe’s  result  Is  that,  we  can  construct  (1  -<*  ) confidence 
limits  for  all  possible  contrasts  6 , 

Pr(H  “ s/  ^tH]  < 6 < H + s/vtHl  } » 1 - a (5.4) 

where 

B*  “ (k  - . (3,5) 


If  for  each  experiment  we  perform,  we  construct  confidence 
limits  according  to  (3.4),  then,  in  a fraction  (1  - a ) 
of  these  experiments,  all  the  confidence  statements  will  oe 
correct;  in  a fraction  * # 0ne  or  more  of  the  statements  will 
be  incorrect.  The  Sclieffe  method,  furthermore,  allows  for 
comparisons  of  means  when  the  nurnoer  of  ooservations,  n. , for 
the  means  are  different.  1 


Example:  Let  us  suppose  the  following  means  are  ootained: 

Xj  a 24;  x2  a 22;  Xj  a 21;  a 17;  x^  a 16  and  n = 4 

replications;  k a 5 treatments.  s2=  4o0  and  number  of  degrees 
of  freedom  a 12.  Suppose  we  wish  to  test  the  contrasts 
xl»x2*x3  versus  x^,xh.  Therefore; 


^4 »*w  • 


H a tc^  a 2xx  ♦ 2x2  + 2Xj  - - 3*^  a 55  ( a 0) 

and  V(H)  a (22  + 22  ♦ + (-3)2  + ( 

55.75  and  therefore  mji]  = 5. 81.  T 
is  significant , we  wisn  to  know  if  |h 
described  In  equation  3.5.  Since 
S a / (k-l)F  Q5*{U,12)  — 3.61;  6.0 

contrast  is  gigAlf leant.  The  93,  ■>  confidence  interval  for  the 
true  value  of  the  contrast,  0 , carwjjp  constructed  using  equation 
3.4.  For  our  example  the  95/3  confluence  interval  for  the  true 
value  of  the  contrast,  e , will  oe  14.03  < e < 55.97.  Thus  the 
difference  oe  tween  x,,x,,,x,  and  x4,xs  v/ill  lie  oetween  the 
interval  just  calculated,  'at  95,*  caiif idenco . 


3)2)  s2/n  a (30)(4.5)/4  s 
1 determine  if  a contrast 
//v(h[  > s where  S2  is 
a 6.02  and 

: > 3. 6l  and  therefore  the 


Figures  13  through  16  show  the  moans,  xT,  stanuard  devia- 
tions, o , and  the  numoer  of  ooservations,  u , of  the  different 
levels  for  uoth  the  output  and  efficiency  of  oatteries  if  1 through 
#4.  These  means  uro  employed  in  the  Sclieffe  method  to  evaluate  the 
estimate,  II,  of  the  contrast  and  construct  the  95  j confidence 
limits  for  several  contrasts.  The  results  of  the  Scheffe  method 
are  given  in  Figure  17  and  16  for  batteries  til  and  ti2  and 
batteries  ii 3 and  #4,  respectively.  Batteries  ,i  1 and  ti2  were 
manufactured  by  General  Electric  and  oatteries  #3  und  #4  were 
manufactured  oy  Sonotone  Inc.  The  asterisked  contrasts,  H,  in 
Figures  17  and  18  indicated  significant  differences  at  95?i;  also, 
the  95.3  confidence  limits  for  the  contrasts  are  given. 
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4.  CONCLUSIONS,  for  catteries  HI  ana  tf 2 (General 
i'lectric) , the  bchefio  results,  Figure  17,  indicate  slight 
difference,  on  the  average,  in  output  between  combinations 
of  contrasts;  however,  a significant  difference  exists  in 
the  efficiencies  as  evidenced,  in  particular,  in  contrasts 
1 and  2.  Thus,  analyzing  the  Scheffe  results,  one  can  infer 
that  lor  pulse  charging  the  use  of  40%  overcharge  does  not 
significantly  increase  cattery  output  out  is  significantly 
less  efficient  than  20%  overcharging  and  thus  inax eating 
large  charge  current  losses  tlirough  conversion  of  charge 
energy  into  heat,  no  significant  effect  of  factor  A (pulse 
amplitude)  or  factor  B (charge  current)  ^s  noticed  on  ■ 

batter  output  or  efficiency. 

For  batteries  and  #4  (Sonctone),  the  Gcheffe  results  | 

for  pulse  charging,  Figure  Id,  indicate  a significantly 
higher  average  output  at  40%  overcharge  than  at  20%  for 
identical  contrasts  as  for  the  General  Electric  catteries, 
but  still  the  40%  overcharge  was  sioiiificautly  less 
efficient  than  20%  overcharge.  Also,  the  new  Gonotone 
cattery  had  a oetter  charge  acceptance  and  therefore  greater 
output  than  the  old  battery  (contrasts  ana  6i;  yet, 

the  olu  battery  had  an  average  output  of  40.70  aupere-hours 
(Fi  ,ure  1>,  level  d^).  A-ain,  factor  A (pulse  amplitude,4  i 

ana  i actor  d (chaise  current)  had  no  significant  effect  on  j 

natter/  output  or  efficiency.  j 

j 

in  comparison,  for  the  general  iilectric  catteries  the  j 

grand  mean  for  the  output  was  36.16  ampere-hours  ana  the 
efiiciioncy  was  76.67%,  while  for  the  Sonotone  catteries  the  j 

grand  mean  for  the  output  was  42.04  ampere-hours  and  the 
efficiency  was  74.67%.  (The  nominal  ratinw  of  these 
batteries  is  34.0  ampere-hours).  j 

in  summary,  for  pulse  charging,  the  General  .electric  ! 

catteries,  on  the  average,  provided  12.2%  greater  output 
than  its  rating,  while  the  oonotone  provided  23.6T-  greater  i 

output.  i\o  noticeable  oattcry  output  degradation  v.as  ooserved  j 

as  each  cattery  underwent  36  random  pulse  charge  - discharge 
cycles  (Figures  j and  7)»  dased  on  the  F test,  the  ocl.effe 
results , design  practicality,  the  need  foe  a quick  cattery 
recharge  time,  and  from  the  standpoint  of  energy  conscrv- ti-un, 
the  optimum  pulse  charging  levels  wo ale  ov  100  amperes  j oak 
pulse,  34  amperes  niain/11.3  ami  eres  overc::ar0e  current  at  a 
20%  overcharge  rate. 

Future  tests  should  oe  couuucted  to  v.eter;;iine  inaividual 
recharging  effects  of  these  optimum  pulse  charging  levels  on 
BJ-433  batteries  after  they  have  unaergone  the  present  field 
service  reels  rue  conditions  (constant  car.^nt,  etc.). 
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Best  Available  Copy 


TYPICAL  CHARGING  METHODS 


Time  of  Main  Charge 
Per  Cent  Topping 


Factor*  and  Level*  of 
Pol**  Charging  Experiment 


A « Peak  Pulae 
*q  - loo  Aaparaa 
*£  » 150  Aqptrta 
*2  * 200  Aaperea 


B « Charge  Currant  lAflgl  * 
h0  - 17/5*7 
Oj,  r 25.5/8.5 
hj  - 3 V11  *3 


C « j Itoppia* 
cq  « 20)1 
« *>0)l 


D ■ Battery  ttrpa 
d0  « lav 
* Uaad 


* The  charge  current  is  divided  into  main/ove rename 
current,  e.g.,  0o  = 17/y.7  indicates  17  amperes 
main  charge  and  {j»l  amperes  overcharge  current 


tflGUiUsi  4 
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PIGUai  6 


200  anps  b-s  34/11.3  amps 


Aaalyili  of  Variance  liable 
(Batteries  #1  and  #2,  Output) 


90QRCK 


A 

B 

C 

D 

AxB 

AxC 

AxD 

BxC 

BxD 

CxD 

AxBxC 

AxlocD 

AxCxP 

BxCxD 


f 

j 


I 

( 


VOCAL 

a 


? 

t 


D«yt 38  lg y HAnp, 


2 

53.7245 

26.0623 

1.4902 

2 

5.5787 

2.78^4 

0.1547 

A 

lu.oi^u 

10.8190 

0.6002 

1 

35.9128 

35.9128 

1.9982 

4 

42.1819 

10.5455 

0.5050 

2 

64.5722 

32.2001 

1.7910 

2 

75-1*045 

37.7023 

2.0915 

2 

15.8343 

7.9172 

0.43*2 

2 

3.0390 

1.5195 

0.0843 

1 

103.4641 

103.4641 

5.7390 

4 

55.4999 

13.8750 

0.7497 

4 

61.o604 

15.4201 

0.8554 

2 

19.5804 

9.7902 

0.5431 

2 

22.2550 

11.1275 

0.6173 

40  721.05>e  iw.Uto* 

n 1290.6059 


FIGURE'  9 
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Analysis  of  Variance  Table 
(Batteries  #1  and  #2,  Efficiency) 


SOURCE DJT. S3  MS f RATIO 


A 

2 

82.4559 

41.2280 

o.8|43 

B 

2 

82.3685 

41.1843 

0.8734 

C 

1 

9595.5U3 

9595.5113 

203.4^24 

D 

1 

5.911 

5.9111 

0.1254 

AxB 

4 

115.6990 

28.9248 

0.6134 

AxC 

2 

146.2396 

73.1198 

1.5507 

AxD 

2 

266.2553 

133.1277 

2.8232 

BxC 

2 

44.7423 

22.3712 

0.4744 

2 

O.3031 

0.1516 

0.0032 

CxD 

1 

228.1604 

228.1604 

4.8386 

AxBacC 

4 

183.2613 

45.8153 

0.9716 

AxBxD 

4 

132.6138 

33.1535 

0.7031 

AxCxD 

2 

48.8554 

24.4277 

0.5I80 

BxCxD 

2 

346.9326 

173.4663 

3.6787 

40  1886.1666  47.1542 


Analysis  of  Varlanoa  Table 
(Bat tar las  #3  tad  #4,  Output) 


SOURCI 

D J. 

S3 

MS 

P RATIO 

A 

2 

12.4191 

6.2096 

0.3571 

B 

2 

16.9933 

8.4967 

0.4obo 

C 

l 

!>0<.2o43 

507.2643 

29.1695 

D 

1 

129.7398 

129*7392 

Y .4605 

AxB 

4 

5^.1557 

12.7099 

0.7354 

AxC 

2 

7.7669 

3*8835 

0.2233 

AxB 

2 

3.5699 

1.7030 

0.1025 

BxC 

2 

3b. I960 

I9.0903 

1.0982 

BxB 

2 

25.160 6 

12.5803 

0.7234 

CxB 

1 

324.1403 

324.1483 

18.6397 

irlhf! 

4 

41.361 r 

10.3404 

0.5946 

17.4804 

4.3701 

0.2513 

AxCxL 

9.5661 

4 .7831 

0.2750 

BaCxD 

2 

75.(22? 

3 (*obl5 

2.1772 

Analysis  of  Variance 

#3  a rad  #4,  Efficiency) 


SOURCE 

D*F* 

as 

MS 

f RATIO 

A 

i 

' 2 

197.5 

yv.fykQ  ■ 

i.2413 

; p. 

''  ?. 54.(335 

■ s’  - ■ i , ' 

77.3668 

0.9721 

1 ' , ■ , C ■ 

j ■ . 

i ■ 

3770.0139 

3770.0139 

47.3698 

! ' : 

50.2002 

50.2002 

0.6308 

j AxS 

1 • 

4 

20.9902 

5^2476 

0.0659 

r 

I AxC 

i 

2 

117.0239 

58.5120 

0.7352 

s " AxD 

l • 

2 

i .0(7(1 

3.9036 

0.0490 

1 - . 

f BxC 

\ .1. 

2 

56.0*22 

28.0461 

0.3524 

| BxD 

2 

168.7832 

84.3916 

l.uou4 

| CxD 

1 

) .6688 

1.8688 

0.0235 

AxBxC 

< 

4 

341.4617 

85.3654 

1.0726 

I 

j AxBxD 

4 

359.21*2 

o?.8048 

1.1284 

j AxCxD 

2 

162.7353 

81.3676 

1.0224 

1 BxCxD 

1 . - ..  .... 

2 

654.1355 

327.06(0 

4.  lt^6 

[ RESIDUAL 

40 

3183.4Y76 

7V.5869 

1 TOTAL 

T1 

9246.1302 

j. 

4 


1 
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■'■0*1 IT*M 


Naan  Output  Response 
(«Vm  - Hours) 
(Batteries  #1  ana  #2) 


Standard 

I 

ff 

ftrror  of  X 

1 

•o 

39*33 

5.73 

1.17 

24 

•l 

37.97 

4.01 

0.82 

24 

•2 

37*5 

2 .02 

0.45 

24 

*0 

38.54 

5.17 

1.06 

24 

bl 

38.1* 

5-15 

1.05 

24 

*2 

37.86 

1.62 

0.33 

24 

«0 

37.79 

3.57 

0.60 

36 

«1 

38.57 

4.68 

0.81 

36 

d0 

37.48 

5.04 

0.84 

36 

«1 

38.83 

3*7 

0.55 

36 

Grand  mean  = J8.16 


IIGURi’i  1> 
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hV'.S 


Mean  Iff  icletcyj $)  Basgoote 


i •.> 
, ■< 


(Batteries  #1  ana 


X 

C 

Stanford 
Irror  of  X 

* 

•o 

78*23 

12.58 

2.57 

£4 

•l 

(to . # 5 

16.  U 

3-29 

24 

*2 

75*62 

i 

12.28 

, 2.51 

24 

too 

78.37 

16.04 

3.27 

24 

V 

76.26 

i 

11.  *6 

24 

b2 

75.96 

12.76 

2.65 

24 

C0 

88.41 

7.07 

1.18 

36 

C1 

65.32 

7.22 

1.20 

36 

do 

77.15 

11.99 

2.00 

36 

dl 

76.58 

15.24 

2.54 

36 

Gz*and  mean  = 76,87 


[ 

{ 


\ * 
[ ) 


» 


\ 
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M»*n  Output  Response 
(Anpere  - Hours) 
(Butteries  #3  and  #4) 


X 

«■ 

Standard 
Brror  of  X 

M 

*0 

42.63 

6.19 

1.26 

24 

*1 

41.76 

4.55 

0.93 

24 

•2 

41.73 

5.05 

1.03 

‘ ‘ 

24 

*0 

42.18 

5.49 

1.12 

24 

n. 

4?. 55 

5-79 

1 "8 

24 

\ 

4l*3> 

4.55 

0.93 

24 

c0 

39.39 

5.48 

O.91 

36 

C1 

.1 1 

44.70 

3.36 

O.56 

36 

«0 

• i 

43.36 

2.20 

0.3’f 

36 

*1 

40.  fw 

6.88 

1.15 

36 

Granu  uiean  s 42,04 
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Mean  Efficiency  (f)  Reaponae 
(Batterl*  #3  and  #) 


X 

Standard 
Error  of  X 

H 

•o 

73  .*6 

• 11.66 

2.33 

24 

•l 

73.53 

10.87 

2.22 

24 

•2 

n.  01 

11.81 

2.41 

24 

*0 

72.77 

11.64 

2.38 

24 

*1 

75.30 

12.61 

2.57 

24 

b 2 

74.69 

13.09 

2.67 

24 

cC 

81.90 

8.00 

1.33 

36 

C1 

67.43 

9.61 

1.60 

36 

<»o 

75.50 

11.44 

1.91 

36 

«i 

73.83 

11.49 

1.91 

36 

Grand  mean  = 74.67 

I 

( 
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SCHEFFE  RESULTS  (GENERAL  ELECTRIC,  BATTERIES  #1  and  #2) 


FIGURE  IT 


SCHEFFE  RESULTS  (SOIOTOIE,  BATTERIES  #3  and  #U) 


Significant  contrast 
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OPTICAL  CHARACTERIZATION  OF  SURFACE  ROUGHNESS 

Eugene  L.  Church  end  John  M;-  Zavada 
Pitman- Dunn  Laboratory 
Frankford  Arsenal 
fhiladelphla,  Penney lvanla 


ABSTRACT, 
niquas  for 


Thore  la  considerable  Army  interest  in  dove  loping  non-eohtact 


techniques  for  quantifying  surface  roughness.  Here  ve  consider  light  scattering 
as  a tool  for  deducing  statistical  properties  of  surface  mlcrotopography  with 
the  aid  of  a suitable  electromagnetic  scattering  formalism.  For  slightly  rough 
surfaces  the  solution  to  the  inverse  scattering  problem  may  be  written  In  terms 
of  the  power  spectrum  of  the  surface  roughness  viewed  through  a vindow  covering 
the  nominal  wavenumber  range  from  the  reciprocal  wavelength  of  light  on  one 
extrema , to  the  reciprocal  disaster  of  the  probing1  bean  on  the  other.  As  illus- 
trations of  the  method  we  consider  the  characterisation  of  two  types  of  residual 
roughness  on  laser  mirror  surfaces:  one-dimensional  periodic  roughness  left  by 

single-point  diamond  turning,  and  isotropic  random  roughness  left  by  mere  con- 
ventional polishing  techniques. 

1.  INTRODUCTION.  The  Army  manufactures  many  high-quality  optical  compon- 
ents for  laser  and  passive  systems.  The  performance  and  usability  of  these 
components  depends  critically  on  their  surface  quality  and  structure.  There  is 
a need  for  quick  and  meaningful  ways  of  testing  these  surfaces  during  manufacture 
and  before  use.  Surface  microroughness  is  a very  Important  parameter  In  deter- 
mining the  quality  of  such  surfaces;  and  this  paper  reviews  the  background  and 
design  of  experiments  to  explore  light  scattering  as  a tool  for  measuring  such 
surface  tnlcroroughnesa . 

Figure  1 Indicates  this  need  more  fully.  There  are  two  principal  methods 
of  characterising  the  residual  microroughness  of  optical  surfaces  now  In  use: 
visual  observation  and  stylus  measurement.  However,  these  methods  have  limita- 
tions as  shown.  What  is  needed  is  a method  that  is  fast  and  objective,  which  can 
be  used  by  unskilled  personnel  or  automated,  and  which  can  measure  roughnesses 
in  the  submlcrolnch  range  - down  to  10  to  100  X rws.  Light  scattering,  as  we 
will  describe  It,  offers  a possible  means  for  doing  this. 

Figure  2 shows  an  artist's  conception  of  how  such  a light- scat taring  device 
might  work.  It  consists  of  a laser  light  source  on  the  left,  a photomultiplier 
detector  on  the  right,  and  the  sample  under  test  in  between.  The  laser  light 
reflects  and  scatters  from  the  sample,  but  a mask  over  the  detector  blocks  out 
the  specularly  reflected  light  and  only  allows  the  scattered  light  to  reach  the 
sensitive  area  of  the  detector.  The  output  of  the  detector  goes  to  a meter.  A 
good  surface  will  scatter  little  light  and  give  a low  readings  a bad  surface  will 
give  more  scattering  and  a high  reading.  In  principle,  the  meter  reading  can  be 
related  quantitatively  to  the  roughness  parsmsters  of  the  surface,  such  as  the 
surface  variance. 
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Although  it  It  taty  to  concalva  of  a device  such  aa  thia,  we  do  not  know 
enough  about  the  roughness  characteristics  of  real  surfaces  at  this  tine  to 
make  such  a device  meaningful  or  reliable. 

The  purpose  of  this  paper  la  to  describe  the  design  of  a series  of  experi- 
ments which  we  are  setting  up  at  Frankford  Arsenal  to  generate  the  necessary 
data  base  on  various  types  of  real  surfaces , : to  irmlt  us  to  design  and  build 

and  use  light -scattering  devices  for  surface  test  and  evaluation.1 

2.  LIGHT  SCATTERING.  We  have  chosen  light  scattering  as  a technique  be- 
cause it  provides  a functional  test  of  surface  performance ; it  is  an  extremely 
sensitive  way  of  measuring  small  deviations  of  an  optical  medium  from  its  average 
behavior.  The  rims t familiar  examples  of  this  is  Rayleigh  scattering,  which 
results  from  the  scattering  of  sunlight  from  microparticles  and  density  fluctua- 
tions in  the  atmosphere,  and  is  responsible  for  the  color  of  the  blue  sky  and  red 
sunset.  The  scattering  we  will  consider  today  differs  from  simple  Rayleigh 
scattering  in  two  ways:  first  we  consider  the  random  deviations  from  e plane 

surface  - a two-dimensional  mirror  - rather  than  a three-dimensional  volume;  and 
second,  we  Include  the  possibility  that  the  adjacent  scattering  centers  ere 
correlated  with  each  other.  In  effect,  then,  we  will  be  examining  "opalescent- 

scattering"  effects  of  the  surface  layer.2 

Figure  3 sketches  soma  of  the  underlying  physics  Involved  in  the  light- 
scattering  process.  Consider  light  incident  on  a sinusoidal,  grating- like  sur- 
face. In  this  case  the  scattered  light  is  bunched  into  a series  of  discrete 
diffraction  orders,  whose  angular  positions  are  determined  by  the  familiar 
grating  equation  shown,  where  is  the  angle  of  incidence  and  0^  Is  the  angle 

of  scattering  or  diffraction.  In  the  case  of  normal  incidence,  0^  » 0,  and  the 

grating  equation  reduces  to  the  form  shown  cn  the  second  line,  where  the  integer 
m - + 1,  +2 is  the  diffraction  order. 

The  positions  of  the  various  diffracted  orders  is  independent  of  the  depth 
of  the  grating,  but  their  intensities  dspend  strongly  on  the  depth.  For  a weak 
grating  - one  where  the  depth  is  much  less  than  the  wavelength  of  light,  which 
is  the  case  of  Interest  here  - only  the  two  first-order  diffraction  lines 
(“*♦!)  appear  with  any  significant  intensity.  They  appear  at  the  symmetric 
angles  shown,  and  their  intensities  relative  to  the  incident  Intensity  - the 

grating  efficiency  l - is  shown  below  in  the  Figure,1  Here  k is  the  wavenumber 

of  the  incident  light  and  is  the  variance  of  the  surface  roughness. 

We  now  see  the  ehape  of  things  to  come  - there  are  two  roughness  length 
scales  in  the  scattering  problem:  vertical  and  horisontal.  The  vertical  rough- 
ness scale  determines  the  intensity  of  the  scattering,  while  the  horisontal 
roughness  scale  determines  its  angular  distribution. 
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Real  surfaces  are  not  simple  sinusoidal  gratings.  However,  we  can  generate 
realistic  surfaces  by  making  a Fourier  superposition  of  elemental  gratings  such 
as  considered  here;  by  sunning  the  scattering  due  to  a large  number  of  gratings 
with  various  wave lengths  running  In  various  directions  over  the  two-dimensional 
surface.  In  the  case  where  the  ras  depth  of  the  grating  is  much  less  than  the 
wavelength  of  light , the  scattering  of  such  a composite  surface  is  simply  propor- 
tional to  the  two-dimensional  spectral  density  of  the  surface  roughness.  This 

Is  true  whether  the  surface  Is  described  statistically  or  deterministically. 

Figure  4 sketches  the  notation  we  will  use  In  describing  an  arbitrarily 
rough  surface.  The  average  surface  is  the  x-y  plane,  and  ^ (x,y)  is  the  deviation 

of  the  real  surface  from  that  average,  with  variance  . The  power  spectrum,  W, 
is  the  average  square  of  the  two-dimensional  Fourier  transform  of  $ , and  is  it- 
self the  two-dimensional  transform  of  the  surface  autocorrelation  function  A. 

If  the  roughness  is  described  as  a stationary  random  function,  A Is  then  a 
function  of  the  separation  parameter,  p.  Specific  examples  of  W and  A are  given 
In  a later  figure. 

Figure  5 gives  the  form  for  the  differential  scattering  intensity  of  a 
rough  surface  In  terms  of  its  power  spectrum,  W,  for  the  illustrative  case  of 

3 

unpolarlsed  radiation  normally  incident  on  a perfect  conductor.  The  scattering 

intensity  Is  proportional  to  k4  - which  reflects  Its  relationship  to  Rayleigh 

scattering.  The  factor  1 + Cos8  6 la  the  polarization  factor  for  electric-dipolo 
scattering,  which  helps  honey  bees  find  their  way  horns  to  the  hive  in  the  blue-sky 
version  of  this  formula.  The  final  factor  is  the  two-dimensional  power  spectrum 
of  the  roughness,  which  contains  all  the  Information  about  the  surface  that  is 
necessary  to  predict  the  scattering.  It  Is  a function  of  the  two  wavenumbers,  p 
and  q.  However,  for  an  isotropic  surface  W Is  a function  only  of  the  Pythagorean 
combination  of  the  two  wavenumbers  p and  q,  which  equals  k Sin  6 - which  Is,  in 
turn,  the  transverse  momentum  imparted  to  the  scattered  photons  by  the  surface 
roughness . 

The  top  formula  In  Figure  5 gives  the  differential  scattering  per  unit  solid 
angle.  The  total  integrated  scatter,  or  TIS,  is  the  Integral  of  this  over  the 
whole  hemisphere.  In  the  case  where  the  scattering  occurs  principally  at  small 
angles,  9,  the  TIS  can  be  written  In  terms  of  the  integral  over  W Itself,  which 

is  Just  the  surface  variance,  o* . In  that  case  the  TIS  assumes  the  simple  form 
given  on  the  bottom  of  the  Figure.  This  well-known  formula  la  often  used  to 
estimate  the  variance  of  the  surface  roughnaea  from  measuramants  of  the  TIS,  using 
sn  integrating  light  sphere.  In  the  experiments  ws  era  considering,  however,  we 
will  look  et  the  differential  light  scattering  intensity,  which  glvas  Information 
about  the  form  of  W itself  and  not  just  Ita  integral. 

The  expression  for  the  TIS  given  on  the  bottom  of  Figure  5 has  another 
significance  In  the  present  context  - It  Is  the  expansion  parameter  used  In  the 
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perturbation  theory  which  la  used  to  derive  the  form  for  the  scattering  intensity 
given  on  the  first  line.  In  other  words,  the  T1S  must  be  « 1 for  this  form  to 
be  valid.  This  Is  the  mathematical  definition  of  a slight ly-rough  or  a weakly- 
rough  surface  as  we  use  it  here.  In  practice  the  T1S  is  of  the  order  of  17.  or 
less,  which  Is  one  of  the  reasons  we  choose  to  look  at  the  scattered  light 
directly,  rather  than  the  corresponding  reduction  of  the  specular  intensity. 

The  results  shown  here  are  a special  case  of  a more  general  electromagnetic 
scattering  formalism  which  was  originally  derived  for  radar  scattering  from  the 
surface  of  the  sea  and  various  terrains,  that  general  formalism  Includes  the 
dependence  of  the  scattering  on  the  angle  of  incidence,  the  initial  and  final 
polarisations,  and  the  complex  dielectric  response  of  the  surface  material.  The 
only  change  that  we  need  make  in  going  from  those  radar  results  to  our  optics 
problem  is  to  change  the  wavelength  scale  from  maters  to  microns,  and  to  use  the 

4 

appropriate  optical-frequency  dielectric  response  of  the  surfaces  under  study. 

3.  PARTICULAR  SURFACES.  Figure  6 gives  the  power  spectra  for  two  idealised 
optical  surfaces:  a random,  isotropically  rough  surface,  and  a deterministic, 

periodic  surface.  In  the  case  of  the  randomly -rough  surface  we  derive  the  form 
of  W from  the  exponential  autocorrelation  function  shown,  where  l is  the  trans- 
verse autocovariance  length.  This  form  is  suggested  by  the  Ornstaln-Zernike 
analysis  of  critical  opalescence.  W is  then  the  two-dimensional  Lorentxlan  showr . 

For  the  purpose  of  designing  our  experiments,  this  first  form  for  W is  taker 
to  represent  the  surface  generated  by  random  polishing  techniques.  It  predicts  e. 
continuous  distribution  of  scattered  intensity  peaked  about  the  specularly  re- 
flected beam.  The  longer  the  correlation  length  1,  the  sharper  the  peaking;  until 
finally,  when  t>  becomes  of  the  order  of  the  else  of  the  probing  beam  spot,  the 
scattered  light  collapses  into  the  diffraction  cone  of  the  specularly  reflected 
beam.  Conversely,  the  shorter  the  correlation  length,  the  broader  the  scattering 
distribution,  until  in  the  limit  where  l « \,  the  results  go  over  Into  the 
simple  Rayleigh  scattering  from  a layer  of  Independent  scattering  centers  lying 
on  the  surface. 

The  second  form  of  W shown  is  a deterministic  form  representing  a periodic, 
a corrugated,  surface;  expanded  in  a Fourier  series.  The  corresponding  power 
spectrum  is  then  a sum  of  products  of  various  delta  functions  corresponding  to 
the  standing  waves  of  the  fundamental  and  harmonica  that  make  up  the  corrugations. 
This  type  of  roughness  does  not  lead  to  a continuous  scattering  distribution,  but 
rather,  to  a series  of  diffraction  peaks  - one  peak  for  each  harmonic  - at  the 
positions  determined  by  the  grating  equation.  In  the  special  case  of  a single 
sine-wave  component,  these  results  reduce  exactly  to  those  of  the  simple  grating 
already  given  in  Figure  3.  For  the  purpose  of  designing  experiments,  this  second 
form  of  W is  taksn  to  represent  the  residual  roughness  on  a surface  generated  by 

3 

the  single-point  diamond  turning. 


One  important  feature  of  s light ly-rough  acattering  la  illustrated  by  thla 
second  form  for  the  power  spectrum  W:  the  acattering  Is  insensitive  to  the  signs 

and  phases  of  the  original  Fourier  decomposition.  This  means  that  even  in  the 
case  of  a deterministic  surface,  the  most  careful  measurement  of  the  scattered 
intensity  will  still  not  allow  us  to  solve  the  inverse  scattering  problem  exactly, 
and  to  reconstruct  the  parent  surface  uniquely.  This  is  the  price  paid  for 
exploring  surface  roughness  with  a light  probe  whose  wavelength  is  much  greater 
than  the  vertical  scale  of  the  roughness.  Although  the  power  spectrum  does  not 
tell  us  everything  about  the  surface  roughness,  it  does  tell  us  all  that  we  need 
to  know  to  predict  light  scattering,  and,  therefore,  it  recommends  Itself  as  a 
natural  quantity  for  characterising  the  residual  roughness  of  optical  surfaces. 

Now  that  we  have  defined  forms  of  U corresponding  to  two  types  of  optical 
surfaces  of  practical  interest,  the  next  step  is  to  substitute  these  results 
into  the  previous  expression  for  the  scattered  intensity  and  *o  evaluate  the  net 
result  in  cases  of  interest. 

4,  ILLUSTRATIVE  RESULTS.  Figure  7 illustrates  typical  results  for  a random 
surface  with  an  exponential  autocorrelation  function.  The  rms  roughness  is  taken 
to  be  50  A and  the  correlation  length,  20  pm.  These  are  typical  values  for 

polished  metal  mirrors.  * The  differential  scattering  intensity  is  plotted 
versus  the  scattering  angle  in  degrees.  There  are  two  curves  - one  for  the  HeNe 
laser  wavelength  of  0.6328  pm,  and  one  for  the  C02  laser  wavrlength  of  10.6  pm. 

In  this  case  of  normal  incidence  the  HeNe  crossectlon  peaks  at  zero  scattering 
angle,  and  falls  off  essentially  as  I/O3.  The  CO^  crossectlon  also  peaks  at  0° 
but  is  generally  much  smaller  in  magnitude  because  of  its  smaller  wavenumber,  k. 

Figure  8 illustrates  the  corresponding  results  for  a corrugated  surface  - the 
particular  periodic  surface  shown  in  the  upper  corner.  The  half  height  is  taken 
to  be  87  A to  give  the  same  TIS  as  the  surface  in  the  preceeding  slide;  that  is, 

7 

about  1%.  The  period  is  5 pm,  which  is  typical  for  mlcromachined  surfaces. 

The  scattering  here  is  in  the  form  of  a aeries  of  delta  functions,  each 
corresponding  to  a harmonic  of  the  roughness:  n ■ 1,  3,  5,  7.  only  odd  harmonics 

appear  because  of  the  vertical  symmetry  of  the  corrugations,  and  only  the  first 
four  of  these  appear  in  scattering  because  n - 9 and  higher  have  wavelengths 
shorter  than  the  HeNe  laser,  and  do  not  diffract.  Only  HeNe  results  are  given 
in  this  Figure  since  the  C02  wavelength  is  10.6  pm,  which  is  already  larger  than 

the  5 pm  fundamental,  so  that  there  is  no  diffraction  at  that  wavelength  and  the 
surface  appears  perfectly  smooth. 

The  Intensity  of  the  diffracted  peaks  falls  off  rapidly  with  increasing 
angle.  This  is  not  due  to  an  inherent  inefficiency  of  the  diffraction  procass 
at  large  angles,  but  because  the  Fourier  coefficients  of  the  particular  shape  we 

have  chosen  fall  rapidly  with  n.  In  particular,  they  go  as  l/r? , so  that  the 
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diffraction  Intensity  shown  hars  falls  off  as  1/n4 . 


The  results  shown  in  Figures  7 and  8 have  a double  value;  they  can  be  used  au 
a basis  for  designing  specific  experiments,  and  they  illustrate  the  types  of  data 
that  we  hope  to  extract  about  real  surfaces.  To  repeat,  differential  light  scatter- 
ing gives  us  information  about  ths  two-dimensional  power  spectrum  of  the  residual 
roughnass.  These  power  spectra  may  be  more  coetplicated  for  real  surfaces. 

However,  for  ease  in  comparison  they  are  usually  described  by  two  length  para- 
meters - cr  and  4,  in  Figure  7,  or  h and  d in  Figure  8 - which  represent  the  verti 
cal  and  horizontal  structure  of  the  roughness. 


As  a final  result,  we  consider  the  nominal  ranges  of  these  two  length  para- 
meters that  are  spanned  by  the  light -scattering  experlamnta.  These  arc  shown  in 
Figure  9.  The  shaded  area  is  for  the  He Ne -wave length  laser  and  the  dashed  area 


for  the  COg'  The  squares  represent  - in  effect  - the  windows  through  which  light 


scattering  allows  us  to  view  the  surface  roughness  paraamters;  a kind  of  transfer 
function  for  the  scattering  process.  The  dot  and  the  cross  represent  the  two 
particular  examples  we  considered  before:  the  isotropically  rough  and  periodic 

surfaces,  respectively.  As  shown,  they  fall  nicely  in  the  HeNe  window. 


The  limits  of  o are  determined  from  the  intensity  of  the  scattering;  here, 
somewhat  arbitrarily,  by  taking  TIS  • IQ-4  and  10°.  The  limits  on  i are  deter- 


mined by  angular  considerations.  A maximum  scattering  angle  of  90°  determines 
the  minimum  value  of  l • A minimum  angle  of  10  railliradlans , or  ~ fc°,  deter- 
mines the  maximum  of  l ■ 100X.  This  minimum  angle  of  ~ is  typical  detector 
resolution  for  the  experiments  we  have  in  mind.  If  we  had  a detector  with  infini- 
te ly  good  resolution,  the  upper  limit  for  l would  be  limited  by  the  diameter  of 
the  probing  beam  spot. 


5,  CONCLUSION.  We  are  now  aetting  up  an  experlmantai  light -scat taring 
facility  at  Frankford  Arsenal  based  on  the  principles  and  reaulta  deacribad  above , 
This  facility  will  ba  usad  to  msasura  the  ecat taring  from  a variety  of  r»al 
optical  aurfacea  - metal  micruis  and  the  metnllsad  eurfacea  of  trsnumleeive 
optica  - obtained  from  the  Frankford  Areenal  Optical  Shops,  industry,  and  other 
government  laboretorlea . We  plan  to  uea  the  data  generated  by  thaee  experiments 
as  a base  for  developing  specific  teat  end  evaluation  devices  to  eetiefy  the 
Army  needs. 
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1.  Introduction 

In  recent  years  statisticians  have  become  increasingly  concerned  with  the 
meaningful  formulation  and  solution  of  certain  multiple-decision  problems  which 
arise  in  experimentation.  Thus,  for  example,  when  an  experimenter  conducts  tests 
to  compare  the  performances  of  several  competing  categories  of  itams,  his  ulti- 
mate objective  often  is  to  select  the  category  (or  categories)  which  is  (are) 
beet,  goodness  being  measured  in  terms  of  a particular  parameter  (e.g.,  the 
population  moan  or  tba  population  variance)  associated  with  the  random  variable 
being  observed.  To  accomplish  this  the  experimenter  requires  e statistical 
decision  procedure  which  will  tell  him  how  many  observations  to  take,  how  to 
take  these  observations,  and  baaad  on  these  observations  which  population (s)  to 
choose;  the  decision  procedure  should  have  the  property  that  the  probability  of 
an  Incorrect  selection  (or  more  generally,  the  risk  or  expected  loss)  is  con- 
trolled at  some  specified  level. 

In  response  to  the  need  for  such  decision  procedures,  research  statisti- 
cians hav«  been  studying  various  possible  appropriate  formulations  of  these 
problams,  and  have  developed  a body  of  statistical  methodology  to  cope  with 
them.  The  procedures  have  come  to  be  referred  to  as  ranking  and  selection 
procedures.  The  purpose  of  this  paper  is  to  introduce  the  reader  to  these 
procedures,  to  describe  some  of  them  and  the  philosophy  underlying  their  use, 
and  to  discuss  their  properties. 

In  Section  2 we  will  pose  the  normal  means  problem,  end  use  it  as  a 
vehicle  for  motivating  some  of  the  basic  ideas.  The  two  most  commonly  adopted 
formulations  of  ranking  and  selection  problems,  namely  the  so-called 
indifference- sons  approach  and  the  subset  approach,  will  be  described.  The 
attributes  of  single-stage,  two-stage,  and  sequential  procedures  devissd  for 
the  normal  means  problem,  under  different  assumptions  concerning  the  population 
variances,  will  be  assessed.  In  Section  3 we  sketch  some  analogous  results  for 
the  normal  variances  problem,  end  in  Section  4 we  mention  results  for  parameters 
of  other  distributions. 


This  research  was  supported  in  part  by  the  U.S.  Army  Research  Office- 
Durham  under  Contract  DAHC04-73-C-0008  and  by  the  Office  of  Naval  Research 
under  Contract  N00014-67-A-0077-0020. 
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The  number  of  research  papers  written  on  subjects  in  this  field  is  now 
vast;  it  is  hoped  that  this  brief  introduction  will  stimulate  the  reader  to 
explora  the  literature,  and  to  apply  the  procedures  where  appropriate. 

2.  The  normal  means  problem 

A very  important  problem  which  arises  frequently  in  applications  is  that 
of  selecting  the  normal  population  which  has  the  largest  population  mean.  Thus, 
for  example,  the  ordnance  engineer  might  be  conducting  firing  programs  to  com- 
pare the  ballistic  performance  of  different  types  of  projectiles  (in  which  case 
his  objective  might  be'  to  select  that  type  which , on  the  average , travels  the 
greatest  distance),  or  the  medical  research  worker  might  be  studying  the  response 
of  patients  to  different  kinds  of  analgesic  drugs  (in  which  case  his  interest 
might  lie  in  selecting  that  drug  which  produces,  on  the  average,  the  longest 
period  of  time  without  pain),  or  the  agronomist  might  be  conducting  field  trials 
with  different  varieties  of  grain  (in  which  case  his  purpose  might  be  to  select 
that  variety  which  produces,  on  the  average,  the  largest  yield  per  acre).  In  all 
of  these  cases  large  values  of  the  means  are  deemed  to  be  desirable;  however,  in 
other  cases  small  values  of  the  means  might  be  considered  desirable.  The  proce-  * 

durea  that  we  will  describe  can,  with  minor  modifications,  handle  these  latter  1 

cases  as  wall. 

In  Sections  2.1  and  2.3.1  we  shall  state  the  statistical  assumptions  1 

which  underlie  the  procedures  that  have  been  developed.  Then  we  shall  describe  j 

v 

several  approaches  to  the  selection  problem.  \ 

2.1  Statistical  assumptions 

We  shall  assume  that  we  have  k sources  ( l<i  <)c ) of  normally 

distributed  data,  the  ith  source  having  population  mean  y . and  population 

2 1 
variance  o^;  population  11^  (l£i<)()  should  be  thought  of  as  being  associated 

with  the  ith  category.  The  y^  are  assumed  to  be  unknown.  Let  m^-j  <,  < 

*•*  < denote  the  ranked  values  of  the  y . ; it  is  assumed  that  the  pairing 

of  the  with  the  y^j  is  completely  unknown.  Possible  assump- 

[ tions  concerning  the  0j  (l£i<)0  will  be  discussed  in  Section  2.3.1.  Throutnout 

| this  paper  (l<4-<)<,jal,2,. . . ) will  denote  the  jth^  observation  from  n^, 

: all  observations  being  assumed  independent. 
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2.2  Sow  formulations 

The  tvo  most*  commonly  used  formulations  of  the  selection  problem  are 
due  to  Bechhofer  [1954]  and  Gupta  [1956],  [1965];  these  are  referred  to  as  the 
indifference- zone  approach  and  the  subset  approach,  respectively.  The  approaches 
are  described  below. 


2.2.1  The  indifference-zone  approach 

The  goal  and  probability  requirement  associated  with  the 
Indifference-zone  approach  are : 

Goal:  "To  select  the  population  associated  with  (2.1) 

It  is  assumed  that  prior  to  the  start  of  experimentation  the  experimenter  can 
' specify  two  constants  (0<6*<»,1A<P*<1)  which  are  than  incorporated 

into  the  following  probability  requirements 


Probability  requirement t 

Prob{Selecting  the  population  associated  with  £ P* 

whon.var  M[]0  - i «*• 


(2.2) 


The  eiqperimenter  then  restricts  consideration  to  procedures  which  guarantee  (2.2). 
(In  (2.2)  the  specified  quantity  6*  can  be  thought  of  as  the  smallest  difference 
"worth  detecting"  between  the  population  mean  of  the  "best"  and  "second  best” 
population;  P*  is  specified  strictly  greater  than  1/k  since  a probability  of 
1/k  can  be  achieved  by  choosing  one  of  the  k populations  at  random. ) 


2.2.2  The  subset  approach 

The  goal  and  probability  requirement  associated  with  the  subset 

approach  are: 

Goal:  "To  solect  a (non-empty)  subset  of  the  populations 

which  contains  the  population  associated  with 

It  is  assumed  that  prior  to  tha  start  of  experiwntation  the  experimenter  can 
specify  a constant  {P*}  (l/k<P*<l)  which  is  then  incorporated  into  the  follow-  j 

i ing  probability  requirement:  ] 

I i 

t / j 


(2.3) 

ii 
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Probability  raquiremsnt: 


ProM  Selected  sublet  contains  the  population  associated  with  > p* 

regardless  of  the  values  of  the  (2.4) 

The  experimenter  then  restricts  consideration  to  procedures  which  guarantee  (2.4). 

Remark  1:  It  is  to  be  noted  that  the  experimenter  plans  his  experiment  assuming 

that  the  population  means  are  not  all  equal;  this  is  a very  reasonable  assumption 
in  almost  all  real-life  situations.  He  is  intareated  in  identifying  the  "best" 
population  — in  this  case  the  population  with  tha  largaat  population  mean.  Goal 
(2.1)  leads  to  a k-decision  problem  since  the  experimenter  must  choose  one  of  the 
k populations  bassd  on  the  outcome  of  hie  experiment  (i.e.,  his  possibla  deci- 
sions are:  01  is  best,  or  n2  is  best,  or  • • • , or  is  best).  Similarly, 

goal  (2.3)  leads  to  a (2^-D-deoision  problem  since  the  experimenter  must  choose 

|r 

one  of  the  2 -1  non-empty  subsets  of  the  k populations  baaed  on  the  outcome 
of  his  sxpsriment  (a.g. , for  k«3  his  possibla  dacisions  are:  only  is  in 

the  subset,  only  n2  is  in  tha  subsst,  only  n3  is  in  ths  subsat,  H,  and  n? 
are  in  the  subset,  and  ng  are  in  the  subset,  n2  and  JI3  ere  in  the 

subset , 1L  end  n2  and  n3  are  in  the  subset).  These  multi-decision  approaches 
are  in  marked  contrast  to  the  classical  2-decision  tsst-of-homogsnslty  approach 


afforded  by  the  Analysis  of  Variance;  in  that  approach  the  experimenter  teste  the 
(usually  completely  unrealistic)  hypothesis  that  the  k population  means  are 
equal,  and  decides  based  on  the  outoome  of  ths  sxpsriment  either  to  accept  the 
hypothesis  or  to  reject  the  hypothesis. 


Remark  2:  As  noted  above,  goal  (2.1)  leads  to  a k-decision  problem.  However, 

depending  on  the  practical  situation  under  consideration,  the  experimenter  can, 
using  the  indifference- sons  approach,  pose  more  general  goals.  For  example,  ha 
may  wish  to  select  the  t (I£t<)<-1)  beet  populations  with  regard  to  ordtr,  or 
he  may  wish  to  aeleet  the  t (I£t<)<-1)  beet  populations  without  regard  to  order, 
t being  fixed  before  the  start  of  experimentation.  (Both  goals  reduce  to  (2.1) 
when  t*l.)  These  more  general  goals  lead  to  a [k!/(k-t)i]-decision  problem 
and  a [k!/tl (k-t )! 3-deoieion  problem,  respectively.  Such  general  goals  and 
others  are  discussed  in  Bechhofsr  [19543  and  Mahamunulu  [19673. 


Ramark  3;  For  goal  (2.1)  and  tha  indifferenca-sono  approach,  the  experimanter 
always  anda  up  by  aalactlng  a alngla  population.  For  goal  (2.3)  and  tha  subset 
approach,  tha  experimenter  anda  up  by  selecting  1 or  2 or  • ••  or  k 
populations,  depending  on  the  outcome  of  the  experiment;  thus  for  this  latter 
approach  tha  number  of  populations  in  the  selected  subset  is  a random  variable. 


2.2.3  Other  approaches 

Santner  [1975]  has  proposed  a restricted  subset  approach  in  which 
tha  experimenter  selects  1 or  2 or  **'  or  c populations,  depending  on 
the  outcome  of  the  experiment,  where  c (l£c£)<)  is  decided  on  and  fixed  before 
the  start  of  experimentation;  his  approach  aan  be  regarded  as  bridging  the 
Indifference- aona  and  aubaat  approaches  ainca  if  c*l  hia  approach  reduoea  to 
the  lndiffarance-mone  approach  while  if  c«k  it  reduces  to  the  subset  approaoh. 
Other  approachea  in  which  more  general  "loss  functions"  are  used  have  been  pro- 
posed by  Somerville  [1954]  and  Falrweather  [1968].  An  approach  in  which  the 
are  aasumed  to  have  prior  distributions  has  been  considered  by  Dunnett  [1960] 
while  a similar  idea  from  a Bayesian  point  of  view  has  been  proposed  by  Ralffa 
and  Schlaiffer  [1961]  and  Dealy  and  Gupta  [1068].  However,  for  brevity  we  will 
not  discuss  these  or  other  approaches. 
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2.3  Assi 


is  concerning  the  variances 


2.3.1  Possible  assumptions 

In  order  to  devise  procedures  which  will  guarantee  (2.2)  or 

(2.4)  for  the  normal  means  problem,  it  is  necessary  to  make  an  assumption  con- 

9 

earning  the  values  of  the  (1<4<J<).  Which  assumption  it  is  appropriate  for 

the  experimenter  to  make  in  any  particular  practical  situation  depends  on  the 
information  available  to  him  at  the  time  that  he  plans  his  experiment.  The  four 
most  common  assumptions  are  that: 

a)  The  values  of  the  a ^ are  known,  and  all  are 

equal  to  a * (say).  (2.5a 

2 

b)  The  values  of  the  (l<i<k)  are  known , but:  not  all 

niHj  equal.  (2.5b 

2 

c)  The  values  of  the  o.  (l<,i<J<)  are  unknown . but  it  is 

^ * 'r'  ^ fo  Cq1 

known  that  they  have  a common  vaiue  <j  (say).  ' 

2 

d)  The  values  of  the  Oj  (l^i.tk)  are  completely  unknown.  (2.5d! 


m ■ hi  m 


#*>* 
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2.3.2  The  variance  assumption  and  associated  procedures 


Once  the  experimenter  has  adopted  one  of  these  assumptions  he  then 
must  choose  a selection  procedure  which  was  derived  under  that  particular  assump- 
tion. 

Thus,  for  example,  if  he  wishes  to  guarantee  (2.2)  and  adopts 
assumption  (2.5a)  or  (2.5b),  then  he  can  use  a single-stage  procedure  (Bechhofer 
[1954]),  a two-stage  procedure  (Alam  [1970]  or  (Tamhane  [1975]),  an  open  sequen- 
tial procedure  without  elimination  (Bechhofer,  Kiefer,  Sabel  [1968]),  or  a closed 
sequential  procedure  with  elimination  (Paulson  [1964]).  If  he  wishes  to  guarantee 
(2.2)  and  adopts  assumption  (2.5c),  then  he  cannot  use  a single-stage  procedure 
(see  Dudewicz  [1971])  although  ha  can  use  a two-stage  procedure  (Bechhofer, 
Dunnett,  Sobel  [1954])  or  a sequential  procedure  (Paulson  [1964]);  similarly, 
if  he  wishes  to  guarantee  (2.2)  and  adopts  assumption  (2.5d),  then  he  cannot  use 
a single-stage  procedure  although  he  can  use  a two-stage  procedure  (Dudewicz  and 
Dalai  [1971]  or  Rinott  [1974]).  Finally,  if  the  experimenter  wishes  to  guarantee 
(2.4),  and  he  adopts  assumption  (2.5a)  or  (2.5c),  then  he  can  use  a single-stage 
procedure  (Gupta  [1956],  [1965]). 

When  the  experimenter  has  adopted  a particular  assumption  and  as  a 
consequence  has  the  option  of  choosing  among  several  competing  procedures,  each 
one  of  which  will  guarantee  his  probability  requirement,  he  then  chooses  one  of 
these  procedures  on  the  basis  of  various  possible  operational  or  cost  criteria. 

An  indication  of  such  criteria  will  be  given  in  our  later  discussion.  In  the 
next  section  we  shall  describe  certain  selection  procedures.  Our  emphasis  will 
ba  on  procedures  which  can  be  used  with  the  indifference-zone  approach  to 
guarantee  (2.2). 

2.4  Procedures  for  use  with  the  indifference-zone  approach  under  the 
assumption  of  common  known  variance 

In  this  section  e shall  describe  three  procedures,  each  one  of  which 
will  guarantee  (2.2)  when  assumption  (2.5a)  is  made;  minor  modifications  of  these 
procedures  will  guarantee  (2.2)  when  assumption  (2.5b)  is  made.  The  procedures 
will  be  introduced  in  the  order  of  their  historical  development,  each  being 
designed  to  afford  different  options  to  the  experimenter. 


2.4.1 


The  easiest  type  of  procedure  to  implement  ie  • * ingle-state  one. 

The  following  single-stage  procedure  wee  proposed  by  Bcehhofer  [1954];  constants 

(see  a),  below)  necessary  to  i implement  this  procedure  arc  given  in  Table  I. 

"a)  Take  a r unarm  n unbar  M of  observations  front  each  of  the  k 
populations  where  N is  the  saellest  integer  greater  than  or 
equal  to  (c^  p#o/4*)  . 

N _ 


b)  Calculate  5^  « £ X^/N  (l<^<)c),  and  let  <y[2]  <#“<y[k] 

denote  the  ranked  values  of  the  X^. 

c)  Select  the  population  which  yielded  3?^  as  the  one  associated 


«lth  U[kj." 


Motet  The  constants  are  computed  under  the  assumption  that  the 

p^  are  in  the  so-called  least-favorable  (LF)  - configuration,  i.e.. 


'Cl]  M[k-1]  M[k] 


= pr,  - 6*. 


Table  I 
Values  of  c, 


p* 

2 

3 

4 

5 

7 

10 

0.99 

3.2900 

3.6173 

3.7970 

3.9196 

4.0B61 

4.2456 

0.95 

2 . 3262 

2.7101 

2.9162 

3.0552 

3.2417 

3.4182 

0.90 

1.8124 

2.2302 

2.4516 

2.5997 

2.7972 

2.9829 

0.B0 

1.1902 

1.6524 

1.8932 

2.0528 

2.2639 

2.4608 

0.60 

0.3583 

0.8852 

1.1532 

1.3287 

1.5583 

1.7700 

The  values  in  this  table  are  abstracted  from  Table  I of  Bechhofer 
[1954]  where  values  for  other  k and  P*  are  also  given.  Addi- 
tional values  for  k = n+1  = 2(1)51  and  P*  = 0.99,  0.975,  0.95, 
0.90,  0.75  are  contained  in  Table  I of  Gupta  [1963];  Gupta's  values 
must  be  multiplied  by  /2  in  order  to  obtain  the  - values 

required  in  (2.6). 
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2.4.2  Open-ended  sequential  procadure  without  elimination 

The  single-stage  procedure  of  Section  2.4.1  i'e  conservative  in  the 
sense  that  tho  constants  pA  necessary  to  implement  it  are  computed  under 
the  assumption  that  the  population  means  are  in  the  LF-conf igurat ion ; however, 
it  has  been  shown  (Hall  [1959])  that  the  probability  requirement  (2.2)  cannot 
be  guaranteed  with  a smaller  M if  the  experimenter  restricts  consideration 
to  single -state  procedures.  If  this  restriction  is  eliminated,  and  multistage 
procedures  are  permitted,  then  certain  gains  can  be  achieved.  What  is  desired 
is  a multi-stage  procedure  which  not  only  will  guarantee  the  probability  require- 
ment (2.2)  when  the  population  Mans  are  in  the  LF-conf igurat ion,  but  also  will 
require  a smaller  number  of  observations  per  population,  on  the  average,  than 
the  N of  (2.5)  when  the  population  Mans  are  in  very  favorable  configurations— 
in  particular  when  - u^k  lj)/o  is  large . The  following  sequent lal 

procedure,  which  possesses  these  attributes,  was  proposed  by  Bechhofer,  Kiefer, 
and  Sobel  [1968],  pp.  258-9,  264-7. 


"a)  Take  one  observation  from  each  of  the  k populations  at 

tn 

each  stage  of  experimentation.  Let  £ X, . denote  the 

j*l  13 

cumulative  sum  from  II.  at  the  mth  stage  of 

mm  m 

experimentation,  and  let  l Xn,.  < l Xr,,.  <...<  I Xr 

jsi  LAJJ  j*i  L*JJ  j=l  LKJJ 

m 

denote  the  ranked  values  of  the  l X... 

j=l  13 

b)  At  the  mth  stage  of  experimentation  (m=l,2,...)  compute 


(2.7) 


k-1 


V ) S*  j=l 
* l *XP  \’TT 


m m 

l x[k].  - I X 


± 


1 ™ 


i«l 


Then  proceed  as  follows: 


i)  If  Zm  <,  (1-P*)/P*,  stop  experimentation  and  select  the 

m 

population  which  yielded  £ Xr,  ,,  as  the  one  associated 

L K J3 

"lth  "Ck3- 
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ii)  If  Z > (1-P*)/P*,  take  another  obaarvation  from  «aeh  of 

H 

the  k populations  and  compute  ZR+1. 

Continue  in  this  manner  until  the  rule  calls  for  stopping." 

Remark  4:  For  (2.7)  the  observations  are  taken  in  vectors,  each  vector  consti- 

tuting a stage,  there  being  one  observation  frem  each  population  in  every  vector. 

The  number  of  stages  (i.e.,  number  of  observations  per  population)  necessary  to  1 
terminate  experimentation  is  a random  variable.  The  expected  number  of  stages  to 


terminate  experimentation  has  been  shown  (B-K-S  [1968],  Tables  12.8.2  and  12'. 8. 3) 

' K I 

to  be  less  than  N for  many  configurations  of  the  (l<i<)Oi  in  particular, 
if  - M[k-1]^°  **  large,  then  with  high  probability  experimentation  will 

cease  after  only  a small  number  of  stages.  Regardless  of  the  configuration  of 
the  (l<i<)c)  experimentation  will  cease  with  probability  one  after  a finite 
number  of  stages. 

r 1 i 

.i  ■ i 

2.4.3  Closed  sequential  procedure  with  elimination 

The  sequential  procedure  of  Section  2.4.2  has  two  possible  drawbacks: 
i)  It  is  openended,  i.e.,  before  the  start  of  experimentation  it  is  not  possible 
to  give  a finite  upper  bound  on  the  number  of  stages  to  terminate  experimentation, 
and  ii)  It  does  not  eliminate  "non-contending"  populations,  i.e.,  it  continues  to 
sample  from  populations  which,  based  on  observations  obtained  in  the  uarly  4'tages 
of  experimentation,  would  appear  to  be  out  of  contention  for  being  selected  as 
"best."  The  following  sequential  procedure,  which  overcomes  these  drawbacks 
of  (2.7),  was  proposed  by  Paulson  [1964];  like  (2.7)  it  guarantees  the  probability 
requirement  (2.2)  when  the  population  means  are  in  the  LF-configuration,  and 
also  tends  to  cease  experimentation  early  when  the  population  means  are  in  very 
favorable  configurations: 

For  fixed  A (0<A<,6A/2)  let  a^  = [o2/(<5*-\ ) ]log[(k-l)/(l-P*)],  and 
let  * the  largest  integer  less  than  a^/A.  Paulson's  procedure  is  actually 
a family  of  procedures  which  depend  on  the  choice  of  A;  in  Remark  9,  below, 
we  shall  make  some  comments  on  the  role  of  A. 


"Tuke  one  observation  from  each  of  the  k populations  at  the 
first  stage  of  experimentation.  Eliminate  from  further  considera- 
tion any  population  for  which  - A < max  Xg^  - X^.  If 


l.<s£k 
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•11  but  om  population  li  elimihatNf  afief-  the  first  stage,  step 
experimentation  ana:  select  the  remaining  population  as  the  one 
associated  with  u^v  Otherwise,  go  on  to  the  second  stage  end  take 
one  observation  from  each  population  not  yet  eliminated.  At  stage 
m (2<p<W^ ) take  one  observation  from  each  population  hot  eliminated 
after  the  (m-l)gE  state,  and  then  eliminate  from  further  considera- 
tion any,  remaining  population  n,  for i which 


m ■ ■ . ■* 

a.  - ml  < msx{  J X . } - £ X.  . 

* ll  .a  j£l  *3  ■ I jti  -V*  „ . 

'•>; . ■ " . - 

where  the  sums  are  only  for  populations  left  after  the  (m-1 jst 
stage.  If  all  but  one  population  is  eliminated  after  the  mth 
stage,  atop  experimentation  and  select  the  remaining  population 
as  the  ono  aesocieted  with  otherwise  go  on  to  the  ,(m*\)st 

stage.  If  more  than  one  population  remains  after  stage  , 
terminate  experimentation  at  ths  (Wx+l)st  stage  by  selecting 
the  remaining  population  with  tha  largest  sum  of  ths  (W^+l) 
obssrvstions  as  the  one  associated  with  Ur^." 


Remark  5:  The  procedure  (2.8)  never  requires  more  then  W^+l  stages  to  terminate 

experimentation. 

Remark  6;  The  procedure  (2,8)  permanently  eliminates  apparently  non-contending 
populations;  thus  the  number  of  observations  taken  at  the  mth  stage  of  experi- 
mentation is  less  then  or  equal  to  the  number  of  observations  taken  at  the 
(m-l)at  stage  of  experimentation. 


Ramark  7;  The  cost  of  experimentation  using  procedures  (2.7)  and  (2.8)  can  be 
meeoured  in  terms  of  expected  number  of  stages  to  terminate  experimentation 
and/or  expected  total  numbar  of  observations  to  terminate  experimentation. 


and 


nax  E {Number  of  stages  to  tenninate  experimentation) 

wl,w2 wk 


max  E {Total  number  of  observations  to  terminate  experimentation) 


are  leas  for  (2.8)  than  for  (2.7)  whan  P*  is  high  (i.e.,  close  to  unity)  but 
the  inequality  is  reversed  if  P*  is  sufficiently  small;  Perng  [1968]  has 
studied  that  question  analytically.  This  result  is  of  practical  interest 


since  it  compares  the  performance  of  (2.7)  and  (2.8)  when  \t 


Cl  3 TO* 


i . e • i 


when,  unknown  to  the  experimenter,  all  of  the  population  means  are  equal  and 
thus  the  expected  number  of  stages  and  the  expected  total  number  of  observations 
are  at  their  Maxima. 


Ramtrk  9:  Fabian  [197M]  pointed  out  the  advantage  of  choosing  X * A*/2,  and 

reeonmendad  for  that  choice  of  X that  1-P*  in  be  replaced  by  2(1-P*) 
yiolding  e [2o2/6*]log[(k-l)/2(l-P*)‘!  b (say);  then  b replaces 

a^  and  6*/2  replaces  X in  (2.8).  this  modified  procedure  still  guarantees 
tho  prokijility  requirement  (2.2)  when  the  population  means  are  in  the 
LF-configuration.  It  uniformly  (in  the  u^)  reduces  the  expected  number  of 
.Steftftq  and  expected  total  number  of  observations  relative  to  the  ones  that 
would  have  been  obtained  with  the  unmodified  procedure  employing  X > 6*/2; 
in  addition,  in  either  the  family  of  unmodified  Paulson  procedures  or  in  the 
family  of  modified  Paulson  procedures  the  choice  X - 6*/2  has  the  property  that 
max  El'iotal  number  of  observations  to  terminate  experimentation)  is 
^1  *^2 ,l"  *^k 

approximately  minimised  for  P*  close  to  unity. 


2.4.4  Two-stage  procedure 

The  sequential  procedures  (2.7)  and  (2.6)  have  the  drawbacks  that 
they  may  not  be  appropriate  for  use  in  certain  types  of  experimentation.  For 
example,  in  agricultural  experimentation  where  yields  can  be  obtained  only 
once  per  year  (or  per  grewing  season),  and  thus  only  one  vector  of  observations 
can  be  obtained  per  time  period,  multi-stage  experimentation  is  impractical. 
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In  such  situations  two-stags  experimentation  would  appear  to  ba  appropriate  . 

Ala*  [1970]  and  Tamhana  [1975]  have  developed  two-stage  procedures  which  guarantee 
the  probability  requirement  (2*2)  when  the  population  meins  ere  In  the  L!*- 
con figuration;  their  procedures  screen  out  the  apparently  non-contending  popu- 
lations in  the  first  stage,  and  concentrate  sampling  on  the  remaining  populations 
in  the  second  (terminal)  stage.  Tamhana* s procedure  has  the  added  virtue  of 
possessing  a ainimax  property  similar  to  that  achieved  by  Fabian's  modification 
of  (2.9)  when  1 ■ 6*/2. 

2.5,  Procedures  for  use  with  the  lndifference-aona  approach  under  the 


As  was  mentioned  in  Section  2.3.2,  if  the  experimenter  wishes  to 
guarantee  (2.2)  and  adopts  assumption  (2.5c)  or  (2.5d)  then  he  cannot  use  a 
single-stage  procedure.  In  this  section  we  shall  consider  two- stage  procedures 
which  accomplish  these  objectives. 


2.5.1  Two-stage  procedure  for  the  common  unknown  variance  case 


The  following  two-stage  procedure  for  the  common  unknown  variance 

case  was  proposed  by  Bechhofer,  Dunnett,  and  Sobel  [1954];  constants  h. 

KiP*»n 

(see  c),  below)  necessary  to  implement  this  procedure  for  P*  ■ 0.95  are 
given  in  Table  II. 

"a)  In  the  first  stage  take  an  arbitrary  common  number  NQ  > 1 
of  observations  from  each  of  the  k populations. 

b)  Calculate  S2  * [ J?  (X..  - £°  X../N_)2/n  which  is  an 

i*l  j«l  13  j*l  i3  0 

unbiased  estimato  of  o2  based  on  n = k(NQ-l)  degrees  of 
freedom . 

c)  Enter  the  appropriate  table  (e.g.,  Table  II,  below,  for 

P*  * 0.95)  with  n « k(Ng-l)  and  the  specified  P*,  end 

obtain  a constant  h.  „ * h (say). 

K,F",n 

d)  In  thu  second  stage,  take  a common  number  N-NQ  of  additional 
observations  from  each  of  the  k populations  where 


N ■=  N. 


if  2<hS/«*r  < H, 


H ■ [2<hS/d*>2]  if  2(h8/6*>2  > HQ, 


(2.9) 


and  [y]  denotes  tha  smallest  intagar  equal  to  or  greater  than  y. 

a)  Calculate  tha  k ovar-all  (first -at aga  plus  aaoond  stage)  sample 
N MM 

suns  J X»j  and  lat  J Xp--..  a £ Xp*>».  ***** 

j«l  j.*l  lAJ3  j«l  l/J3 

N N 

I Xr.  1 4 denote  the  ranked  values  of  tha  | X,. . 
j.l  tKJ3  j.i  ID 

N 

f)  Select  the  population  which  yielded  J Xr.  as  the  one 

j-1  LKJ] 

associated  with 


Note;  The  constants  ^ pe  n computed  under  the  assumption  that  the 
(l£i£j<)  ar*  in  the  LF-conf igurat ion . 


Table  II 

Values  of  hjj  p*  n lEE.  p*  * 0.95 


k 

2 

3 

4 

% 

5 

7 

10 

5 

2.02 

2.44 

2.68 

2.85 

3.08 

3.30 

6 

1.94 

2.34 

2.56 

2.71 

2.92 

3.12 

7 

1.89 

2.27 

2.48 

? 62 

2.82 

3.01 

8 

1.86 

2.22 

2.42 

1.55 

2.74 

2.92 

9 

1.83 

2.18 

2.37 

2.50 

2.68 

2.86 

10 

1.81 

■ 2.15 

2.34 

2.47 

2.64 

2.81 

15 

1.75 

2.07 

2.24 

2.36 

2.51 

2.67 

20 

1.72 

2.03 

2.19 

2.30 

2.46 

2.60 

30 

1.70 

1.99 

2.15 

2.25 

2.40 

2.54 

60 

1.67 

1.95 

2.10 

2.21 

2.35 

2.48 

m 

1.64 

1.92 

2.06 

2.16 

2.29 

2.42 

i 

i 

1 

1 


9^1 


j 


1 


The  values  in  this  table  are  abstracted  from  Table  la  of 
Dunnett  [1955];  Table  lb  of  Dunnatt  [1955]  gives  corre- 
sponding values  for  P*  * 0.99;  Dunnett' s p equals 
our  k-1. 


Notes  The  value  of  Q given  for  the  n « • row  of  Dunnett  [1955],  Table  la 

is  the  sane  as  the  value  given  by  Qupta  [1963],  Table  I,  for  the  sane  k-1  > p * n 
and  P*  * 0.95  ■ l-o. 


Remark  10:  The  total  number  of  observations  N required  by  the  two-stage  procedure 

' " 2 
is  a random  variable  since  its  value  depends  on  the  value  of  S ; no  additional 

observations  are  taken  in  the  second  stage  if  S is  sufficiently  small. 


Remark  11:  Paulson  [1964],  Section  5,  proposed  an  open-ended  sequential  procedure 

which  permanently  eliminates  non-contending  populations;  his  procedure  is  appli- 
cable in  situations  in  which  the  common  variance  is  unknown. 

2.5.2  Two-stage  procedures  for  the  completely  unknown  variance  case 


Dudewicz  and  Dalai  [1971],  and  also  Rinott  [1974],  proposed  two- 
stage  procedures  for  the  completely  unknown  variance  case.  Like  (2.9),  the 
common  number  of  observations  in  the  first  stage  for  each  of  these  procedures 
is  arbitrary  (>1),  while  the  total  number  of  observations  per  population  is 
a random  variable. 


2.6  Procedure  for  use  with  the  subset  approach  under  the  assumption  of 


common  (known  or  unknown)  variance 


As  was  mentioned  in  Section  2.3.2,  if  the  experimenter  wishes  to 
guarantee  (2.4)  and  adopts  assumption  2.5a)  or  2.5c),  then  he  can  use  a single- 
stage  procedure.  The  following  single-stage  procedure  was  proposed  by  Qupta 
[1956],  [1965]  for  use  under  assumption  2.5c);  constants  n (see  c),  below) 

necessary  to  implement  this  procedure  are  given  in  Table  III.  (Under  assumption 
2.5a,  the  random  variable  S in  d)  of  (2.10)  is  replacad  by  o,  and  the  value 
of  d.  _ for  n » • is  used.) 


V 


"a)  Taka  a ooawon  arbitrary  number  K > 1 of  obaarvations  from 

aach  of  tha  k populations . 

X 

b)  Calculate  « l X^/H  (l<J<)c)  and  let  < X^  <...<  )T[k] 

denote  tha  ranked  value*  of  tha  X. ; also  calculate 
k M N 1 

£■  l l (X..  - l X.4/X)S/n  which  ia  an  unbiased  estimate 
i-1  j»l  X1  j»l  13 
2 

of  o based  on  n ■ k(H-l)  degrees  of  freedom. 

c)  Enter  the  appropriate  table  (e.g.,  Table  XXX,  below,  for 
p*  s o.95)  with  n * k(N-l)  and  the  specified  P*,  and 
obtain  a constant  d^  pi  n * d (say). 

d)  Retain  the  population  ( l£i<)< ) in  the  selected  subset  if 

and  only  if  3?^  ^[k]  ~ ^S/^T." 

Table  III 

Values  of  dt.  _ for  P*  = 0.95 

" K • i"  ill  r ■' 


n 

k 

2 

5 

10 

15  2.48 

3.34 

3.78 

20  2.44 

3.25 

3.67 

30  2.40 

3.19 

3.59 

60  2.36 

3.12 

3.50 

(2.10) 


The  values  in  this  table  are  abstracted  from  Table  I of  Gupta 
and  Sobol  [1957]  which  gives  many  additional  d-values  for 
P*  * 0.75,  0.90,  0.95,  0.975,  0.99. 

SSJ-'l'  dk,F\n  * ^ hk,p*,n  wh0«  \,p*,n  **  a1™  tn  Table  II. 

Remark  12;  Tho  width  of  the  "yardstick”  in  d)  of  (2.10)  is  dS/Sti  which  doorcases 
with  H;  thus  the  larger  the  value  of  N,  the  smaller  the  expected  number  of 
populations  that  will  be  included  in  the  selected  subset.  Also,  for  fixed  N 


tha 


favorable  tha  configuration  of  tha  population  naans  (e.g. , tha  larger 


tha  valua  of  (p^j  - ^,  ^)/o),  tha  anallar  tha  axpaetad  number  of  populations 
that  will  ha  included  in  tha  aelected  aubaat.  (This  axpaotad  number  always  lisa 
batwaan  unity  and  kP*. ) 


Ranark  13:  Zn  practiea  tha  aubaat  approach  ia  oftan  uaad  for  screening  purpoaas, 

ainca  it  tanda  to  aliainata  "non-contanding"  populations  (i.a.,  thosa  with  aaall 
p- values)  from  tha  aalactad  aubaat.  Tha  populations  ratainad  in  tha  aubaat  can 
than  bs  subjactad  to  further  study  in  an  indapandant  follow-up  axpariaant  in 
which  tha  indiffaranca-aona  approach  (say)  is  uaad. 

2.7  Factorial  wxparinanti  involving  naans 

Tha  statistical  nodal  givan  in  Saction  2.1  ia  appropriate  for  ainila- 

factor  experiments.  In  a two-factor  experiment  we  have  rc  normal  populations 

“ " 2 
JI^  (l£liF.  l<j<c ) with  population  maana  p^  and  population  variances  o^. 

r c 

It  ia  sometimes  appropriate  to  aaauna  that  p..  ■ u + o.  + 0.  ( 7 a.  * 7 0.  e 0) 

13  1 3 i»l  1 j«l  3 

i.e.,  that  there  is  no  interaction  between  tha  factors,  and  that  o^  = o 
(l<4<f,  l<j<c).  Hera  the  and  the  0^  are  referred  to  as  the  "effects”  of 

the  first  and  second  factor,  respectively.  It  is  assumed  that  p,  the  a., 

2 1 
the  Bj,  and  a are  unknown.  Let  a^-j  i “[23  — “ *—  °[r]  and 

0Cl]  “ ®[2]  -•**-  P[C]  danot#  the  ranked  values  of  tha  and  tha  0^;  it 

is  assumed  that  tha  pairing  of  the  Jl^  with  tha  and  0^  l<j<,c) 

is  completely  unknown. 

In  tha  above  setup  it  is  possible  to  consider  goals  such  as 

Goal;  "To  select  the  'level'  of  the  first  factor  associated 

with  <nd  simultaneously  to  select  the  'level'  (2.11) 

of  tha  second  factor  associated  with  £[c]»" 

with  associated  probability  requirements.  Such  problems  are  treated  for  the 
indifference-zone  approach  in  Section  4 of  Bechhofer  [1954],  The  virtue  of 
conducting  factorial  experiments  in  this  situation  is  discussed  by  Bawa  [1972]. 

The  indifferenco-zone  selection  procedures  of  Sections  2.4  and  2.5  can  be  used 
in  multi-factor  experiment;  it  is  only  necessary  to  make  appropriate  modifica- 
tions in  the  procedures. 


It  is  *1*0  possible  to  conduct  single-factor  or  multi-factor  ranking 
and  selection  experiments  using  ths  standard  experimental  designs  such  as  ran- 
domised blocks  and  Latin  squares,  and  these  designs  play  the  same  type  of  role 
here  as  they  do  in  classical  hypothesis-tasting  situations. 

2.8  Meant  va.  a fixed  known  standard 

In  Section  2. 4. 1-2. 4. 4 and  2. 5. 1-2, 5.2  the  selection  procedures 
proposed  were  devised  to  select  the  category  associated  with  the  largest 
v -value.  However,  in  certain  classes  of  experiments  even  the  "best"  on*  of 
the  competing  categories,  i.s.,  the  category  with  the  largest  p-valus,  may  not 
be  good  snough  to  warrant  the  experiment er'c  selecting  it.  for  example,  if  the 
competing  categories  are  drugs,  the  lest  on*  may  not  be  worthy  of  consideration 
unless  the  expected  period  of  inmuni\y  obtained  with  that  drug  it  at  least  torn* 
specif lad  period  of  time;  or  if  the  competing  categories  are  types  of  heat 
treatment  of  steel,  the  beet  one  may  not  be  deemed  satisfactory  unless  the 
expected  tensile  strength  resulting  from  that  type  of  treatment  is  at  laast 
some  specified  minimum  value.  Such  types  of  probltms  involving  comparisons 
of  means  with  a fixed  known  standard  are  considered  by  Bachhofer  and  Turnbull 
[1974],  [1975a];  in  the  first  paper  a single-stage  procedure  is  proposed  under 
assumption  (2.5a),  and  in  tha  second  a two-stage  procedure  is  proposed  under 
assumption  (2.5c).  These  procedures  ere  generalisations  of  Bechhofer  [1954] 
and  Bechhofer,  Dunnett,  and  Sobel  [1954],  Gupta  and  Sobel  [1958]  proposed  a 
single-stage  procedure  for  this  problem  using  the  subset  approach. 

3.  The  normal  variances  problem 

Section  2 dealt  with  the  normal  means  problem.  Corresponding  procedures 
exist  for  the  normal  variances  problsm.  Ranking  end  selection  problems  involving 
variances  arise,  for  example,  when  the  ordnance  engineer  is  intersstsd  in 
selecting  that  type  of  projectile  which  yields  the  smallest  dispersion  of  * 
range,  or  when  the  laboratory  technician  is  intersatsd  in  selecting  that 
measuring  instrument  which  has  the  highest  precision  (e.^. , that  scale  which 
has  the  greatest  reproducibility).  An  analogue  of  the  single-stage  procedure 
given  in  Bechhofer  [1954]  for  normal  means  is  given  in  Bechhofer  and  Sobel 
[1954]  for  normal  variances;  factorial  experiments  involving  variances  are 
treated  in  Bechhofer  [1968a]  and  [1968b]  using  a model  proposed  in  Bechhofer  [I960]. 
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Beehhofer  and  Turnbull  [1975b]  la  the  counterpart  for  variances  of  Beehhofer  and 
Turnbull  [1974].  An  analogue  of  the  procedure  given  in  Gupta  [1956]  for  normal 
means  is  given  in  Gupta  and  Sobel  [1962]  for  normal  variancea. 


4.  The  Bernoulli  p problem,  and  other  problems 

Ranking  and  selection  problems  involving  Bernoulli  p's  (i.e. , probabilities 
of  "success"  on  a single  trial)  arise,  for  example,  when  a consumer  is  interested 
in  selecting  that  producer  whose  product  has  the  smallest  fraction  defective. 

An  analogue  of  the  procedure  given  in  Beehhofer  [1954]  for  normal  means  is 
given  in  Huyett  and  Sobel  [1957]  for  Bernoulli  p's,  while  the  counterpart  of  the 
procedure  given  in  Gupta  [1956]  for  normal  means  ia  given  in  Gupta  and  8obel 
[1960]  for  Bernoulli  p's. 

Sobel  [1954]  proposed  a sequential  procedure  for  selecting  the  exponential 
population  with  the  largest  meant  his  results  have  applicability  in  reliability 
studies.  Beehhofer,  Kiefer,  and  Sobel  [1966],  p.  63  considered  sequential 
procedures  for  ranking  parameters  of  certain  stochastic  processes  such  as  the 
Poieson  process  and  the  Wiener  process.  Various  resoarch  workers  have  proposed 
procedures  for  many  other  ranking  and  selection  problems  involving  parameters 
of  distributions  arising  in  practice. 

[ 5*  Closing  remarks 

[ Ths  ranking  and  selection  formulation  of  statistical  problems  involving 

inferences  concerning  k >,  2 categories  has  wide  applicability  in  the  solution 
| of  problems  arising  in  experimentation.  In  this  paper  we  have  sketched  only 

[ a small  number  of  the  relevant  ideas  and  procedures.  The  interested  reader 

is  referrad  to  Beehhofer,  Kiefer,  and  Sobel  [1968]  for  references  up  to  that 
! date,  and  to  Gupta  and  Panchapakssan  [1972]  for  references  to  the  lsttsr  data 

I concerning  the  subset  approach.  Additional  and  more  recent  references  are  given 

! by  Wetherill  and  Ofoeu  [1974].  The  writer  would  appreciate  learning  of  experi- 

I mental  situations  in  which  some  of  the  procedures  described  herein  have  proved 

! helpful. 

I 
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ABSTRACT.  The  constraints  of  limited  time,  limited  resources,  and 
large  inherent  sample  variance  characterize  army  field  experimentation. 

For  these  reasons  the  significance  level,  the  power,  or  both,  of  tests 
of  hypotheses  are  usually  not  as  high  as  one  would  desire.  The  decision 
must  be  made  as  to  whether  low  power  or  low  significance  is  less  undesirable- 
or.  perhaps,  we  can  eat  our  cake  and  have  it,  too. 

This  paper  briefly  describes  the  US  Army  Combat  Developments 
Experimentation  Command  and  its  mission  to  perform  field  experiments. 

It  then  discusses  a typical  example  which  brings  out  the  major  problems 
in  this  type  experimentation.  Methods  of  dealing  with  a large  number  of 
sources  of  variation  are  discussed. 

1.  INTRODUCTION.  The  process  known  as  combat  developments 
includes  the  planning  of  the  future  army;  how  it  will  be  equipped , how  it 
will  be  organized,  how  it  will  fight.  The  specific  products  which  are 
produced  to  address  these  topics  are  materiel  need  documents , tables  of 
organization  and  equipment,  and  field  manuals.  Often  in  the  development 
of  these  products , a concept  or  a piece  of  equipment  must  be  tested  with 
troops.  It  is  the  mission  of  the  Combat  Developments  Experimentation 
Command  (CDEC)  to  design  and  conduct  field  experiments  in  these  areas. 

The  players  in  these  experiments  are  trained  army  troops , the  equipment 
is  operational  or  prototype  hardware,  the  environment  is  a realistic  combat 
environment.  CDEC  deals  in  small  unit  experimentation,  usually  smaller 
than  company  size  elements  on  each  side. 

The  sources  of  variation  in  field  experimentation  are  more  difficult 
to  handle  than  in  many  types  of  experimental,  situations.  This  is  caused  by 
the  necessary  freedom  given  the  players  to  behave  as  if  they  were  in  a 
combat  environment.  Dealing  with  this  variation  is  the  major  subject  of 

the  paper. 
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2.  THE  CDEC  MISSION.  It  is  quite  appropriate  that  representative* 
of  CDEC  should  participate  in  the  Army  Design  of  Experiments  Conference. 

We  are  one  of  only  two  organizations  in  the  army  with  the  word  experiment 
(in  some  form)  in  their  title.  CDEC  is  unique  in  the  Army,  DOP.  and 
probably  in  the  world.  Its  sole  mission  is  tn  perform  field  experiments 
in  which  the  equipment  is  put  in  the  hands  of  the  troops  to  he  used  in  an 
operationally  relevant  environment.  This  is  as  distinguished  from  field  tests, 
engineering  tests,  and  operational  tests  in  which  the  major  concern  is 
the  operation  of  the  equipment  rather  than  the  interaction  of  the  equipment 
with  the  personnel  and  the  environment. 

CDEC  is  a service  organization.  With  rare  exceptions  the  requirements 
for  experimental  data  comes  from  an  organization  outside  of  CDEC.  This 
organization  tasks  CDEC  through  its  higher  headquarters , the  Training  and 
Doctrine  Command  (TRADOC);  and  acts  as  proponent  for  the  experiment. 

The  major  customers  for  the  CDEC  product  are  the  TRADOC  schools  and 
centers  — the  Combined  Arms  Center.  Armor,  Infantry,  Artillery,  Aviation, 
and  Air  Defense  schools.  CDEC  provides  data  to  satisfy  one  of  three 
requirements.  The  first  is  to  compare  two  or  more  alternatives.  Thesv 
alternatives  maybe  hardware  systems,  strategies,  or  organizations.  The 
second  is  to  compare  one  or  more  alternatives  with  a standard  system. 

The  third  is  to  provide  data  for  use  in  computer  models , simulations . or 
war  games.  The  data  may  be  used  for  model  validation  or  for  data  input. 

CDEC  is  a subordinate  command  of  TRADOC.  The  staff  section  in 
TRADOC  responsible  for  CDEC  is  the  Deputy  Chief  of  Staff  for  Combat 
Developments.  The  Operational  Test  and  Evaluation  Agency  (OTEA) 
monitors  the  work  of  CDEC  as  well  as  all  operational  tests  throughout  the 
army.  The  Commander  of  OTEA  chairs  the  Test  Schedule  and  Review 
Committee  (TSARC)  which  meets  twice  yearly  to  review  the  army  test  pro- 
gram. The  TSARC,  a general  officer  committee,  manages  the  Five  Year 
Test  Program.  This  program  consists  of  all  army  operational  tests,  joint 
tests,  and  force  development  tests  and  experiments. 

Within  CDEC,  the  experimentation  mission  is  performed  by  three 
major  divisions.  The  fir  it  is  responsible  for  developing  the  overall  CDEC 
program,  working  with  the  proponents  in  identifying  the  problems,  and 
designing  the  experiment.  The  second  division  performs  the  experiment . 
analyzes  the  data  and  writes  the  report.  The  third  division  maintains  and 
operates  the  highly  complex  system  of  instrumentation  necessary  to  collect 
the  data. 
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The  instrumentation  system  at  CDEC  consists  of  five  major  categories 
of  hardware. 

a.  Instrumentation  Control.  This  is  a system  of  computers  which 
include  a GE  M605.  an  XDS  930,  and  two  XDS  910's.  CDEC  is  currently  in 
the  process  of  changing  over  to  a new  computer  control  system. 

b.  Position  Location  System.  This  system  consists  of  a series  of 
movable  transmission  towers  which  can  be  located  anywhere  on  the  range . 
a series  of  small  transponders  which  automatically  send  and  receive 
messages,  and  two  relay  stations  ?hich  interface  with  the  computer.  The 
computer,  by  sending  messages  to  a td  receiving  messages  from  the  trans- 
ponders via  the  transmission  towers,  can  compute  the  location  of  the 
transponders  on  a second  by  second  basis . Each  player  element  carries  a 
uniquely  coded  transponder  to  identify  his  location  to  the  computer.  A 
continuous  location  record  is  kept  of  all  players  by  the  computer. 

c.  Simulated  Fire.  The  simulated  fire  system  consists  of  60  small 
laser  generators.  These  eye  safe  lasers  can  be  bore  sighted  with  a 
weapon  and  wired  so  that  when  the  trigger  is  pulled  a coded  laser  beam 

is  emitted  rather  than  a projectile.  When  a laser  beam  is  emitted,  a 
message  to  that  effect  is  sent  to  the  computer  through  the  transponder, 
automatically.  If  the  laser  beam  strikes  a target,  a laser  sensor  on 
the  target  is  activated.  This  message  is  sent  to  the  computer  through 
the  transponder  on  the  target.  This  "hit"  does  not  necessarily  mean 
the  target  was  "damaged"  or  "killed"  by  the  exchange.  The  computer 
now  possesses  the  following  knowledge  from  the  engagement:  who  fired, 
the  weapon  type,  the  location  of  the  firer,  who  was  the  target,  and  the 
location  of  the  target.  The  computer  then  determines  the  range  of 
engagement  and  the  aspect  and  speed  of  the  target.  Then  using  prob- 
ability of  kill  functions  resident  in  memory,  the  actual  probability  e*  kill 
is  computed.  A random  number  is  drawn  by  the  computer  to  determine 
whether  damage  or  a kill  has  been  inflicted  by  the  engagement.  If  the 
target  is  assessed  as  a casualty  it  is  sent  a message  to  that  effect  and 
the  laser  weapon  of  that  player  is  deactivated  by  the  computer  so  he 
fan  t shoot  any  more.  The  target  also  detonates  a smoke  grenade  and 
displays  a distinctive  panel  to  inform  the  other  players  in  the  game  of  his 
•tntus. 

d.  Live  Fire,  There  are  two  computer  controlled  live  Lire  ranges, 
one  offensive  and  one  defensive.  The  man-sized  targets  can  be  programmed 


953 


Best  Available  Copy 


to  come  up  in  an  asqposed  position  on  any  preassigned  schedule , to  fire 
blanks,  to  duck  for  a programmed  time  if  a bullet  comes  close,  and  to 
fall  down  and  stay  down  if  hit. 

e.  Support  Instrumentation.  The  s> stems  represented  ;.n  the  support 
instrumentation  are  additional  data  gathering  and  storage  systems  and 
control  systems.  There  are  two  radar  systems  for  target  tracking, 
there  are  cameras  to  record,  manually  or  automatically.  what  is  taking 
place,  there  are  several  multiple  channel  voice  recording  systems  to 
record  by  radio  or  telephone  all  that  goes  on  in  a trial , and  there  is  a 
master  timing  system.  This  timing  system  simultaneously  time  tags  all 
records  kept  of  the  experiment  for  the  purpose  of  cross  checking  redundant 
data  records. 

The  geographic  area  used  in  CDEC  experiments  is  Hunter  Liggett 
Military  Reservation  at  Jolon,  California.  This  reservation  has  166,000 
acres  consisting  of  all  terrain  types  from  flat  treeless  plains  areas  to 
heavily  forested  rugged  mountains.  The  weather  is  mild  and  rainy  in  the 
winter,  hot  and  dry  in  the  summer.  CDEC  controls  the  air  space  over 
Hunter  Liggett  to  an  altitude  of  10,000  feet. 

3.  LEVELS  OF  DATA  REFINEMENT.  CDEC  recognizes  six  levels  of 
data  refinement.  The  hierarchy  represented  in  these  levels  is  valuable 
in  discussing  the  form  and  nature  of  experimental  results  desired  by  the 
proponent.  The  six  levels  are  defined  here: 

a.  Level  1 - Raw  Data.  Data  at  Level  1 are  data  in  their  original 
form.  This  includes  data  on: 

(1)  Data  collection  forms  bsed  by  a controller  or  data  collector. 

(2)  Magnetic  tape.  This  refers  to  the  original  tape  used 
during  the  conduct  of  the  experiment. 

(3)  Camera  film,  unedited. 

(4)  Voice  Recording  System  tape  (VRS),  unedited. 

(5)  Punch  cards  or  hard  copy  print-outs  of  the  contents  of  (2). 

At  this  level  no  data  purification  has  taken  place  except  the  elimination 
of  data  which  are  obviously  invalid,  such  as  that  caused  by  an  instrumen- 
tation malfunction. 
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b.  L«wl  2 - Reduced  Data. 

Data  at  Level  2 have  keen  taken  from  the  r«».w  data  form  and 
consolidated  for  evaluation  of  data  quality.  This  first  level  of  data  refine- 
ment is  performed  soon  after  the  data  are  collected,  usually  within 
one  day. 

c.  Level  3 - Ordered  Data.  Data  at  Level  3 have  been  checked  for 
accuracy  and  placed  in  logical  order.  Data  at  this  level  may  be  produced 
in  one  or  more  of  the  following  forms: 

(1)  Ordered  computer  print  -out . 

(2)  Typed  listing. 

( 3 ) Purified  and  ordered  tape. 

(4)  Edited  camera  film. 

(5)  Edited  VRS  tape. 

(6)  Punch  cards. 

Data  in  this  form  will  have  been  thoroughly  purified.  Invalid  data  will 
have  been  identified  and  eliminated.  The  data  may  be  ordered  on  any  of 
several  dimensions  such  as: 

(1)  Measurement  taken,  e.g. , time  to  detect  total  exposure  time, 
range  to  target,  or  crossing  velocity. 

(2)  Trials. 

(3)  Player  elements. 

(4)  Time  of  day. 

No  arithmetic  operations,  with  the  possible  exception  of  counting,  are 
applied  to  data  at  this  level.  Data  at  Level  3 are  distinguished  from  data 
at  Level  2 by  the  terms  "edited"  and  "ordered." 
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il.  Level  4 - Descriptive  Data.  Data  at  Level  4 have  been  subjected 
to  any  of  several  elementary  statistical  arvd  mathematical  operations. 

Data  in  this  form  will  usually  consist  of: 

(1)  Frequency  distributions.  Such  distributions  may  be  in 
tabular  form,  histograms,  or  curves  smoothed  by  eye. 

(2)  Computed  means,  variances  and  standard  deviations  of 
distributions. 

(3)  Computed  medians,  modes,  ranges,  quartiles,  deciles,  etc. 

(4)  Computed  percentages. 

(5)  Computed  correlation  coefficients. 

Processing  of  data  to  Level  4 does  not  include  drawing  inferences. 
Significance  of  the  difference  between  any  of  the  measurements  is  not 
gmn.  Data  at  this  level  differ  from  those  at  Level  3 in  that  they  are 
summarized  and  combined  into  more  concise  measures.  Data  at  this 
level  should  not  go  beyond  what  may  be  called  "data  descriptive  of 
what  happened  in  the  experiment." 

e.  Level  5 - Inferred  Data.  Data  at  Level  5 have  undergone  statistical 
tests  of  hypothesis  and/or  interval  estimation.  The  design  of  the  experi- 
ment is  constructed  so  that  the  specific  planned  tests  and  estimates  can 
be  made.  Although  there  are  many  tests  of  hypothesis  in  the  literature, 
those  techniques  which  will  most  often  be  used  at  this  level  of  data 
refinement  are: 

(1)  "Student's"  t test. 

(2)  CM  square  test. 

(3)  Snedecor's  F test. 

(4)  Analysis  of  variance. 

(5)  Regression  analysis. 

(<)  Any  of  several  standard  non-parametric  tests. 
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Hypotheses  to  be  tested  will  include  testing  whether: 

(1)  An  observed  sample  represents  a sample  from  a standard 
or  known  distribution. 

(2)  Two  (or  more)  observed  samples  are  both  (all)  samples  from 
the  same,  perhaps  unknown  distribution. 

(3)  A sample  parametric  estimate  such  as  a mean,  median, 
standard  deviation,  or  regression  coefficient  differs  from  a given  fixed 
value. 

(4)  Two  or  more  independent  sample  parametric  estimates  differ 
from  each  other. 

Data  at  this  level  do  not  include  statistical  inferences  on  ex  post  facto 
questions  generated  either  from  an  outside  source  or  by  the  results 
themselves.  Level  5 data  are  limited  to  preplanned  statistical  analysis 
of  the  data  generated  in  the  experiment. 

f . Level  6 - Analyzed  Data.  Data  at  Level  6 have  received  a more 
thorough  and  detailed  analysis  than  at  Level  5.  Analysis  at  this  level  is 
characterized  by  two  features: 

(1)  It  answers  questions  or  investigates  areas  not  planned  for 
in  the  original  experiment  or. 

(2)  It  combines  the  results  of  the  experiment  with  data  obtained 
elsewhere  in  order  to  generalize  the  conclusions  which  may  be  drawn. 

A classic  example  of  Level  6 analysis  is  the  insertion  of  experimentally 
derived  data  into  a combat  model  to  generate  new  information  to  help 
answer  force  mix  questions.  A second  example  is  the  use  of  experimen- 
tally derived  intervisibility  data  to  determine  the  probability  that  a 
target  is  available  when  a tube  launched  guided  projectile  arrives. 

Another  way  to  distinguish  analyses  at  Level  5 and  Level  6 is  that  data 
at  Level  5 are  pure  data,  are  derived  by  deductive  reasoning,  and  concern 
themselves  solely  with  the  quantitative  nature  of  the  population  from 
which  the  experimental  sample  was  drawn.  Data  at  Level  6.  on  the  other 
hand,  answer  operational  questions,  require  inductive  reasoning,  and  use 
the  experimental  results  to  assist  in  shedding  light  on  the  key  military 
issues. 
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Although  CDEC  reports  to  the  proponent  on  data  at  Level  3 , the 
subject  of  the  remainder  of  this  paper  concerns  analyses  at  Levels  4 and 
5.  Analysis  at  Level  6 is  generally  done  by  an  analytic  organization 
rather  titan  an  experimentation  organization. 

4.  A TYPICAL  EXPERIMENT.  In  this  example  will  be  described  some 
design  problems  which  are  characteristic  of  the  type  of  experiments  con- 
ducted at  CDEC.  A general  solution  to  most  of  these  problems  will  be 
indicated.  The  following  paper  by  Dr.  M alii  os  will  discuss  some  specific 
techniques  that  have  been  used  to  solve  some  of  the  more  troublesome 
problems. 

This  example  is  hypothetical  (barely).  Barely,  only  because  it  has  been 
simplified  to  its  basic  important  elements.  It  is  very  typical.  Let  us 
say  there  are  two  helicopter  mounted  target  acquisition  devices  the 
proponent  wants  to  compare.  He  also  wants  to  compare  the  devices  at 
two  ranges,  i.e. , he  suspects  a device  range  interaction.  (Generally  there 
are  many  other  independent  variables  he  is  .'-itc  -r-  --2d  in  such  as  the  size  of 
the  target,  is  the  target  moving  or  stationary,  i-  the  target  hot  or 
cold  (IR  emissions),  is  the  helicopter  hovering  ~ oving,  etc.)  Let  us  say 
further  in  our  2x2  experiment  that  we  have  fcn-  - , money,  fuel . etc. . 
for  exactly  48  trials.  The  dependent  variables  is  "time  to  detect." 

One  would  likely  propose  such  a design  and  model  as  is  shown  in  Table  1. 

TABLE  1 - BASIC  DESIGN 


X. 

RANGE 

2. 

MODEL:  y = m + d + r + dr  + e (1) 

ANALYSIS  OF  VARIANCE 

d.f . 

Devices  (d)  1 

Range  (r)  1 

Interaction  (dr)  1 
Error  (e)  44 


DEVICE 
A B 


i 

i 
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Fine — it  will  work.  But  we  must  have  players.  How  many  players7 
Certainly  at  least  1 and  at  most  48.  Let's  look  at  those  two  extremes. 

A proposed  model  for  the  case  of  one  player  would  be , 

y = m+  d + r + dr+t  + e (2) 

where  t is  the  learning  effect.  Certainly  he  wouldn't  have  the  same 
expected  behavior  on  the  first  trial  as  on  the  48th  trial.  This  learning  effect 
is  neither  linear  nor  random.  Dealing  with  it  presents  a significant  problem. 
Moreover , one  must  keep  in  mind  the  population  about  which  we  are  making 
inferences.  In  terms  of  people,  that  population  is  the  group  of  people  who 
may  use  the  device  to  detect  targets  in  combat.  Certainly,  making  infer- 
ences about  that  population  with  a sample  of  one  player  is  poor  procedure. 

The  48  player  proposition  does  not  suffer  from  either  of  these  short- 
comings. It  is  shown  in  many  experiments  that  the  greatest  single  cause 
of  variation  in  experimental  results  is  the  difference  between  players. 

The  pos  tula  ted  model  for  this  case  is . 

y = m + d + r + dr  + p + e (3) 

The  player  effect  (p),  is  confounded  with  the  error.  It  occurs  as  a term 
in  the  expected  mean  square  of  all  four  sources  of  variation  shown  in  the 
ANCVA  of  Table  1.  Since  the  player  variance  is  so  large,  it  overpowers 
the  F ratio  and  nothing  but  the  most  obvious  treatment  effects  show  up  as 
significant.  (This  is  apart  from  the  logistical  problem  of  finding  and  train- 
ing that  many  qualified  players.) 

Now  that  we  have  disposed  of  those  two  options  let  us  look  at  another 
and  a more  reasonable  player  option.  Let  us  use  four  players , each  one 
playing  in  three  trials  in  each  of  the  cells  of  the  design  in  Table  1.  Since 
each  player  plays  in  12  trials , each  of  the  cells  can  contain  one  of  each 
of  the  player  orders:  1,  2,  ...  12.  A model  for  this  design  is, 

y = m + d + r + dr  + p + t + e (4) 

where  all  interactions  except  the  device  X range  interaction  are  assumed 
away.  "^his  design  is  better.  Wc  can  isolate  the  player  and  learning  effects, 
but  there  is  still  the  question  of  sample  size.  Is  a sample  of  4 from  all 
possible  soldiers  an  adequate  representation? 
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A final  * and  best  alternative  we  will  consider  is  one  which  uses  12 
players,  each  one  playing  once  in  each  of  the  four  cells.  Again  we  can 
balance  the  order  effect  within  the  cells.  The  postulated  model  is  the 
same  as  in  equation  (4),  however,  the  analysis  of  variance  would  have  re- 
distributed degrees  of  freedom  and  different  coefficients  in  the  expected 
mean  squares.  The  analysis  of  variance  is  shewn  in  Table  2. 

TABLE  2 


ANALYSIS  OF  VARIANCE 


Source  of  variation 

d.: 

Devices 

1 

Range 

1 

Interaction 

1 

Players 

11 

Order 

3 

Error 

30 

Since  "Players"  would  normally  be  a random  factor,  more  correctly  the 
error  term  for  the  main  effects  of  range  and  device  would  be  their  inter- 
action with  player.  We  have  assumed  this  to  be  zero  which  makes 
"error"  the  denominator  of  all  F ratios.  If  one  is  unwilling  to  assume 
zero  interactions,  the  player  interactions  can  be  computed  and  used  as 
error  terms.  In  this  case,  the  player  X device  X range  interaction  is 
confounded  with  order  so  no  legitimate  test  exists  for  the  range  X device 
interaction. 

All  of  the  foregoing  is  merely  the  peak  of  the  iceberg.  The  real 
problems  in  field  experimentation  center  around  dealing  with  such  factors 
as: 


a.  Carry  over  effect.  This  effect  is  that  influence  the  treatment 
combination  experienced  by  the  player  in  trial  i has  upon  his  performance 
in  trial  i + 1. 

b.  Environmental  factors. 

(1)  Dust. 

(2)  Wind. 

(3)  Atmospheric  attenuation. 
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(4)  Light  level. 

(5)  Sun  angle. 

c.  Target  location  factors. 

(1)  Target  to  background  contrast. 

(2)  Background  clutter. 

(3)  Shape  contrast. 

It  is  clearly  impossible  to  control  all  of  these  sources  of  variation. 

Each  one  may  have  an  effect  on  the  measurement  of  interest  in  a given  trial. 
One  solution  is  to  balance  on  these  factors , i.e. , assure  in  the  design 
that  whatever  the  level  of  these  factors  is,  that  level  occurs  with  equal 
frequency  in  each  of  the  four  cells  in  Table  1.  This  can  be  achieved 
physically  by  having  a single  target  and  four  observers,  two  at  each  range 
with  one  of  the  devices  at  each  range.  Perform  the  detection  task 
simultaneously.  Then,  as  nearly  as  possible,  we  can  say  that  the  differences 
in  detection  time  are  due  to  the  difference  in  the  main  effects  and  their 
interaction  (plus  random  error).  We  perform  a sequence  of  12  such  games 
with  the  proper  player  and  order  design.  Now  the  analysis  shown  in 
Table  2 is  valid,  or  is  it?  When  the  main  effect  mean  squares  are  computed, 
to  be  sure,  the  variance  caused  by  the  above  mentioned  sources  is  absent. 

But  when  the  error  mean  square  is  computed  it  shows  up  in  all  of  its  glory. 

What  happens,  then,  is  that  the  expected  mean  square  of  the  denominator 
of  the  F ratio  contains  terms  not  found  in  the  numerator.  This  leads  to 
small,  inappropriate  F ratios. 

This  brings  us  to  the  last  straw.  To  avoid  the  problem  of  inappropriate 
F ratios  and  still  have  degrees  of  freedom  left  for  error,  we  can  ran- 
domize the  assignment  of  treatment  combinations  to  target  locations , 
days,  and  times  of  day.  The  terms  mentioned  above  then  occur  in  all 
ejected  mean  squares  and  the  result  is  valid  F ratios. 

5.  COUP  DE  GRACE.  In  real  life,  do  we  really  randomize?  This  only 
leads  to  tests  of  very  low  power.  WhicL  is  more  important,  to  get  the 
very  best  estimates  possible  of  the  behavior  of  a system,  or  to  get 
estimates  of  the  behavior  of  a system  which  can  be  subjected  to  valid 
statistical  tests?  We  think  the  former.  It  is  for  this  reason  that  we  favor 
balancing  rather  than  randomizing.  We  still  do  tests  of  hypotheses  but 
recognize  clearly  that  these  tests  are  often  very  conservative.  In  other  words, 
we  are  more  interested  in  obtaining  the  most  accurate  data  possible  at  level 
4 of  refinement  at  the  expense  of  doing  rigorous  level  5 analyses . 
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SAMPLE  SIZE  TRADE-OFFS 
AND 

THE  CONSTRAINED  MAXIMIZATION  OF  INFORMATION 

William  S.  Mall ics 
The  BUM  Corporation 
Monterey,  California 

ABSTRACT . Cost  of f cc t i vencss  is  applied  to  sample  size  determination 
for  field  trial  ex per imentat ior, . Compound  distribution*  are  used  in  es- 
tabl j $h  inn  trade-off s between  sample  size  combinations  and  expected  Infor- 
mation. These  trade-offs  are  usod  in  maximizing  information  under  cost 
constraints. 

K INTRODUCTION.  This  writing  illustrates  the  use  of  sample  size 

trade-offs  in  the  design  of  field  trials.  Trade-offs  are  possible  when 
sample  size  has  more  than  one  dimension.  For  example,  a two  dimensional 
sample  size  occurs  when  responses  are  to  be  drawn  from  each  of  a number 
(n)  of  experimental  units  in  each  of  a number  (N)  of  trials  per  fixed  cn- 
vl ronment*--uni ts  arc  nested  in  trials,  and  trials  arc  nested  in  environ- 
ments. Since  a number  of  (n,N)  selections  can  give  rise  to  approximately 
the  same  level  of  information,  an  appropriate  selection  is  one  which  costs 
least,  assuming  other  constraints  have  not  been  violated. 

Sample  size  trade-offs  provide  a basis  for  answering  the  following 
questions  regarding  the  design  of  field  trials. 

How  was  prior  knowledge  used  in  the  design  of  the  proposed  ex- 
periment? 

How  much  information  is  to  be  gained  for  a given  expend!-  (1.1) 
ture? 

What  is  the  loss  (gain)  in  information  as  the  expendituie  is 
decreased  (increased)? 

Experimental  objectives  and  information  level  (l.L.)  are  related  as 
follows.  Model  specification  Is  dependent  on  the  objectives,  while  l.L. 
is  inversely  proportional  to  the  variability  associated  with  the  model. 

Key  to  this  relation  is  the  model  which  reflects  the  objectives  and  dic- 
tates the  design  and  method  of  analysis.  In  turn,  variability  can  be 
quantified  in  terms  of  a variance,  a generalized  variance  (the  determi- 
nant of  a covariance  matrix  associated  with  the  model),  or  other  appro- 
priate measures. 

Our  use  of  l.L.  is  directed  at  quantifying  information  on  the  state 
of  knowledge,  not  the  state  of  uncertainty  as  in  the  Shannon  formulation 
of  Information;  see  Pierce  (i960.  Thus,  l.L.  is  more  attuned  to  the 
definition  of  Intrinsic  accuracy  (see  Fisher  (1950))  in  describing  the 
amount  of  information  yielded  by  each  member  of  a sample  regarding  a 


* When  environments  are  random,  sample  size  becomes  three  dimensional. 
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distributional  parameter.  However,  with  the  diversity  of  definitions  of 
"information",  I.L.  is  left  in  general  terms  so  as  to  allow  for  flexibil- 
ity and  further  experimentation  in  applying  the  quantity.  For  cxamole, 
a generalized  variance  (as  the  reciprocal  of  I.L.)  is  proportional  to  the 
square  of  the  volume  of  the  concentration  ellipsoid  (see  Cramer ( 19k6) ) • 
the  geometrical  representation  of  a distribution  about  its  center  of 
gravity.  Or,  if  the  objectives  are  formulated  in  terms  of  a linear  com- 
bination of  variables,  the  covariance  matrix  is  reduced  to  a single 
variance  as  the  inverse  of  the  1.1.  required,  say,  to  attain  a confidence 
Interval  of  a certain  width  or  to  test  for  a certain  difference  between 
expectations.  Regarding  our  loose  definition  of  I.L.,  it  should  be  re- 
called that  Fisher  (1950)  tempered  his  definition  as  follows:  "I  am  more 

Inclined  to  examine  the  quantity  (information)  as  it  emerges  from  mathe- 
matical investigations  and  to  judge  of  its  utility  by  the  free  use  of 
common  sense,  rather  to  impose  it  by  formal  definition". 

Sample  sizes  should  be  based  on  the  model,  but  oftentimes  this  is 
no  easy  matter.  Consider,  for  example,  a proposed  field  experiment  where 
a yes  or  no  response  is  to  be  measured  from  each  of  n units  in  each  of 
N trials  per  environment,  in  addition,  trials  are  to  be  quantified,  if 
possible,  on  an  ad  hoc  basis  through  measures  of  prevailing  meteorologi- 
cal (met)  conditions,  and  it  is  anticipated  that  the  probability  of  re- 
sponse will  vary  between  trials  within  environments.  Our  recourse  to 
this  problem  involves,  firstly,  utilizing  distributions  which  account 
for  varying  probability  of  response,  secondly,  yetting  prior  estimates 
of  the  distribution's  parameters,  and  thirdly,  establishing  trade-offs 
between  sample  sizes.  Regarding  the  varying  probabilities,  compound  dis- 
tributions are  applied.  As  to  prior  estimates,  these  can  be  obtained 
through  analyses  of  data  from  exploratory  trials,  from  past  experiments 
of  a similar  nature,  from  computer  simulation  studies,  from  combinations 
thereof,  or,  as  a last  resort,  through  a good  guess. 

Section  2 contains  background  material  on  data  collected  in  previous 
field  trials  In  one  particular  environment.  In  Section  3.  these  data  are 
used  in  establishing  sample  size  trade-offs  in  maximizing  I.L.  under  co*t 
constraints.  The  aspects  of  model  derivation,  parameter  estimation,  and 
goodness  of  fit  are  considered  in  the  appendices. 

2.  RESULTS  OF  DATA  ANALYSIS  FOR  AN  AIR  POLLUTION  EXPERIMENT.  Con- 
sider an  environment  with  an  inversion  occurring  at  a few  hundred  meters 
In  height,  the  inversion  defined  as  an  atmospheric  layer,  of  limited  ver- 
tical extent,  in  which  temperature  increases  with  height.  Beneath  the 
Inversion  windflnw  is  light  and  variable,  with  pollutant  plumes  emanating 
continuously  from  a fixed  point  source  (say,  a smokestack).  The  pollu- 
tant does  not  penetrate  the  inversion  so  that  the  plumes  are  dispersed  by 
the  winds  and  turbulence.  These  conditions  can  lead  to  large  concentra- 
tions of  ground  level  pollution,  all  of  which  mandate  useful  predictions 
of  concentrations  from  proposed  or  existing  sources  in  such  environments. 
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Prior  to  proposed  field  experimentation,  data  were  analyzed  from  an 
experiment  in  a similar  environment.  Therein,  air  samplers  were  located 
near  ground  level  within  an  area  of  size  1200  meters  by  1000  meters 
concent rat  ions  at  varyinq  heights  above  ground  level  were  not  considered. 
Dissemination  of  a pollutant  simulant  was  from  a fixed  point  source  loca- 
ted at  the  crnter  of  the  grid  with  responses  drawn  from  n 35  samplers, 
arranged  in  a uniform  pattern,  in  each  of  N»22  trials.  The  response  f ro»n 
each  sampler  was  concentration  or  the  number  of  particles  cumulated  over 
a fixed  span  following  dissemination. 


Better  known  model'.,  based  largely  on  the  normal  distribution,  are 
aimed  at  predicting  concentration  at  given  grid  coordinates;  e.g.,  under 
the  Gaussian  plume  model,  the  effluent  Is  assumed  to  expand  normally  in 
the  horizontal  and  vertical  directions  as  It  moves  downwind  with  a pre- 
vailing wind;  see  Panofsky  (1969).  However,  these  models  are  known  to 
be  inadequate  when  eddy  sizes  ore  sufficiently  large  to  move  the  effluent 
along  a meandering  path,  as  was  the  case  in  these  previous  trials.  I n* 
stead  of  predictions  at  given  grid  coordinates  (sec  Mallios  (1969)),  al- 
ternative consideration  was  given  to  models  which  predict  percentages  of 
the  grid  area  subjected  to  given  ranges  of  concentration  and  which  ac- 
count for  between  trial  variation  in  these  percentages.  Accordingly, 
based  on  compliance  with  experimental  objectives,  sampler  responses  were 
categorized,  on  a per  trial  basis,  to  one  of  the  four  particle  number 
ranges 

C,:(0.99).  C2:(IOO,999),  Cj  :( 1000,9999) , C^MO.000);  (2.1) 


e.g.,  samplers  with  particle  counts  between  100  and  999  were  assigned  to 

category  C_.  With  x. . denoting  the  number  of  sampler  responses  assigned 
2 ij  k 

to  category  Cj  in  the  j-th  trial,  we  have  i x..  - 35.  The  data  are 
presented  in  Table  I.  1-1 


Subjecting  these  data  to  a contingency  table  analysis  leads  to  the 
obvious  result  that  the  multinomial  distribution  does  not  account  for 
the  between  trial  variation  of  x^  ■ (*|j.  .x^)'  ; i.e.,  assuming  x^ 

follows  the  multinomial  distribution  with  probability  p.  n (p 
4 22  22  ,J»  1 

/ l x 


IJ 


» • • • 'Pj|j ) * 


and  setting  p 


J-l 


IJ 


Ij  U’ 


the  hypothesis  H: 


is  rejected  in  view  of  the  value  of 


which,  under  H 


l (x..  - np.)2  / np.  - 175. 1»* 
i.J  ’ 


, follows  the 


2 

X 


distribution  with  63  degrees  of  freedom. 


Readings  of  meteorological  (met)  Instrumentation,  operational  during 
these  trials,  were  evaluated  in  attempting  to  classify  trials  Into  met 
regimes  within  which  the  multinomial  probabilities  were  approximately  con- 
stant. It  was  found  that  the  instrumentation  could  not  identify  differing 
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Table  l.  A per  trial  grouping  of  sampler  responses. 


Categories 


V 

(0.99) 

C2: 

000,999) 

r ♦ 
3‘ 

(1000,9399) 

c4: 

(>.10,000) 

Trial 

% 

(No.) 

% 

(No.) 

1 

(No.) 

% 

(No.) 

1 

67 

(23) 

11 

( 4) 

11 

( 4) 

11 

( 4) 

2 

49 

(17) 

23 

( 8) 

14 

( 5) 

14 

( 5) 

3 

74 

(26) 

14 

( 5) 

6 

( 2) 

6 

( 2) 

4 

37 

(13) 

20 

( 7) 

32 

(11) 

11 

( 4) 

5 

66 

(?3) 

23 

( 8) 

8 

( 3) 

3 

( 1) 

6 

63 

(22) 

11 

( 4) 

6 

( 2) 

20 

( 7) 

7 

34 

(12) 

14 

( 5) 

34 

(12) 

18 

( 6) 

8 

60 

(21) 

18 

( 6) 

11 

( 4) 

11 

( 4) 

9 

14 

( 5) 

34 

(12) 

29 

(10) 

23 

( 8) 

10 

37 

(13) 

43 

05) 

6 

( 2) 

14 

( 5) 

ll 

29 

(10) 

52 

08) 

8 

( 3) 

11 

( 4) 

12 

37 

(13) 

34 

(12) 

11 

( 4) 

18 

( 6) 

13 

37 

(13) 

41 

04) 

8 

( 3) 

14 

( 5) 

14 

43 

05) 

29 

(10) 

14 

( 5) 

14 

( 5) 

15 

18 

( 6) 

18 

( 6) 

33 

(12) 

31 

01) 

16 

57 

(20) 

0 

( 0) 

40 

04) 

3 

( 1) 

17 

34 

(12) 

18 

( 6) 

34 

02) 

14 

( 5) 

18 

37 

03) 

17 

( 6) 

23 

( 8) 

23 

( 8) 

19 

49 

07) 

26 

( 9) 

8 

( 3) 

17 

( 6) 

20 

II 

( 4) 

52 

08) 

23 

( 8) 

14 

( 5) 

21 

4o 

04) 

17 

( 6) 

23 

( 8) 

20 

( 7) 

22 

43 

05) 

14 

( 5) 

26 

( 9) 

17 

( 6) 

Totals 

(327) 

0 84) 

044) 

015) 

Average  4 2.51  23.3%  18.7% 


l 4.9* 


regimes  under  these  conditions,  so  that  the  22  trials  were  taken  as  charac 
teristlc  of  one  met  regime.  An  area-coverage  distribution  applicable  to 
this  regime  is  obtained  by  compounding  the  multinomial  and  multivariate 
beta  distributions.  The  result,  given  by 

it 

hfa;  n,  n)  "n.'fl  (a(+Xj,  a2+x2  f,3+x3  nit+x4^/,9^a  l,rt2 '‘'3  ? *;  •'  • l2*2) 

is  termed  the  mul tmomial-mul t i variate  beta  (MMB)  distribution,  .Acre  the 
Oj>I  are  parameters  of  the  quadrivariate  beta  distribution;  see  Appendix  I 

for  the  moments  of  (2.2),  Appendix  2 for  estimation  of  the  a.,  and  Appen- 
dix 3 for  goodness  of  fit  of  (2.2)  the  data  in  Table  I. 

It  should  be  noted  that  (2.2)  is  a conglomerate  distribution  in  that 
it  should  account  for  the  between  trial  variation  in  x^ even  if  the  pollu- 
tant source  were  varied  between  trials  and/or  if  there  were  substantial 
measurement  error  and/or  if  trials  could  be  classified  into  distinct  met 
regimes  and/or  if  samplers  were  positioned  varying  heights  above  ground 
level.  However,  the  more  the  sources  of  unidentified  variation,  the 
greater  the  variability  and  the  less  the  I.L.,  so  that  one  should  always 
Isolate  sources  of  variation  when  possible. 

3.  INFORMATION  CONTOURS  AND  OPTIMAL  SAMPLE  SIZE  SELECTIONS . Before 
addressing  the  questions  in  (1.1),  we  first  quantify  the  change  in  I.L. 
as  the  sample  size  is  varied.  Thereupon,  an  optimal  (n,M)  combination, 
say  (n^.N^),  Is  that  which  maximizes  I.L.  under  constraints  of  fixed  costs 

In  this  problem,  I.L.  could  be  taken  as  the  inverse  of  the  general- 
ized variance  of  a_,  the  maximum  likelihood  estimate  of  4;  i e.,  under 
fairly  general  conditions,  it  is  known  that  variance  a_  - ♦_,/N,  where 

4 - -E(32  log  h/9a,Bar) 

and  h denotes  MMB  distribution.  I.L.  is  then  estimated  by  M ( > | after  .a 
is  substituted  for  a.  Alternatively,  I.L.  could  be  taken  as  the  inverse 
of  variance  (j/x_)  » jt'Vt,  where  V ■ variance  (x)  and  is  an  appropriate 
linear  combination  of  the  Xjj  e.g.,  the  tj  might  be  defTned  as  the  mid- 
points of  the  four  categories  given  in  (2.1).  As  a matter  of  illustration, 
we  take  the  former  as  the  measure  of  I.L. 

Figure  1 contains  contours  of  fixed  I.L.  * N|f(£»  a ) | values,  rang- 
ing from  .3x10  ^ to  1000x10  for  varying  values  of  n and  N.  If,  for 
example,  n » 30  and  N Is  increased  from  6 to  10,  the  I.L.  is  Increased, 

roughly,  from  5x10  ^ to  50x10  \ a 9002  Increase  in  I.L.  For  this  appli- 
cation, this  is  to  illustrate  the  N should  be  Increased  at  the  expense 
of  n and  that  large  n adds  little  to  I.L.  when  N is  held  constant;  e.g.. 

If  the  area  of  interest  were  saturated  with  samplers  and  trials  were  few 
In  number,  then  a great  deal  would  be  known  about  these  few  trials,  but 


little  would  be  known  about  trials  In  general  (which  form  a major  source 
of  variation) . 

To  maximize  I.L.  subject  to  fixed  costs,  we  employ  the  simple  cost 

model 

C » CB  ♦ CL  + Nct  + nNC$,  (3-D 


where  C denotes  total  funds  available;  is  the  base  cost;  is  the 
expected  loss  in  funds  due  to  materialized  risks;  Is  the  cost  per 
sampler  and  Cy  the  cost  per  trial.  Substitution  of 

C - 200,  CB  - 15,  CL  - 5,  CT  - 10,  and  Cs  - 1/2  into  (5-1)  yields 

180  - ION  ♦ nN/2.  (3.2) 

We  could  introduce  a Lagrangian  multipler,  X,  and  differentiate 
N|$|‘A(180  - ION  - nN/2)  with  respect  to  N,  n,  and  X In  determining 
(vNo).  However,  a graphical  approach  is  the  easiest  recourse,  i.e., 

superimposing  (3.2)  onto  the  contours  In  Figure  I,  we  choose  (i»o .N^)  as 

that  combination  corresponding  to  that  maximum  value  of  I.L.  on  the  curve 
(3.2).  From  the  plot  of  (3.2),  given  In  Figure  I,  it  Is  seen  that  (20,9) 
Is  an  adequate  approximation  to  (n0«HQ) * 

jk.  CONCLUDING  REMARKS.  Now  we  are  in  a position  to  answer  the 
questions  in  ( I . l)  . F i rst ly , prior  knowledge  has  been  utilized  in  the 
form  of  the  depiction  In  Figure  I.  Regarding  the  second  question,  an 

I.L.  of  approximately  I5xl0~3  Is  to  be  gained  for  a fixed  expenditure  of 
C - 200.  Relating  I.L.  to  center  of  gravity  (see  Section  1)  means  that 

I.L.  - I5xl0'3  is  to  be  interpreted  on  a relative  basis.  Had  I.L.  been 
equated  to  the  inverse  of  (as  discussed  in  Section  3),  absolute 

interpretations  could  have  been  given;  e.g.,  in  this  case,  the  expenditure 
of  C ■ 200  would  give  rise  to  an  I.L.  which,  say,  would  lead  to  a confi- 
dence Interval  of  a certain  width. 

The  third  question  In  (1.1)  is  answered  by  varying  the  value  of  C 
In  (3.1)  which,  In  turn,  shifts  the  cost  curve  in  Figure  I up  or  down. 

In  this  manner,  one  can  determine  the  loss  or  gain  in  I.L.  as  the  budget 
is  Increased  or  decreased.  If,  for  example,  the  budget  is  decreased  to 
the  extent  that  I.L.  is  deemed  Insufficient  to  anser  the  experimental 
objectives,  thought  should  be  given  to  whether  the  experiment  should  be 
conducted. 


APPENDIX  I 

THE  MMB  DISTRIBUTION 


Let  the  multinomial  distribution, 

r r 

f(x;n,p)  » nlit  (p*i/x.l),  E(x)  - n £,  >:  x . 

i«l  1 1 “ i-1  J 

describe  the  within  trial  variation  of  x !!  (*j,.. 


- n,);  p.«  I, 

i-l  ' 

. ,xr)  ' . I f £ is  con- 


stant between  trials,  then  this  distribution  also  accounts  for  the  be- 
tween trial  variation  in  x.  However,  £ may  vary  between  trials,  even 
when  environmental  conditions  are  closely  monitored  and  the  scheduling 
of  trials  is  arranged  such  that  these  conditions  are  as  homogenous  as 
possible.  Such  variation  in  £ might  be  described  by  the  multivariate 
beta  distribution, 


where 


g(p;a) 


r , 

it  p“  I /0  (a ) , a ■ (a 

i-l  1 


.af)'  >0, 


r r 

Ma)  - it  r(a.)/r(r  a.). 

1-1  1 i-l 

Compounding  f(.x;n,£)  and  g(£;a)  {see  Feller  (1957))  end  letting  r*f»  for 


the  application  in  Section  2,  we  have 


I l-P,  I-P,-P2  l-P.-Pj-P, 

h(x;n,a)  - ^dp^  d^2fo  dp3^o  Jf (x;n,£)g(£;  ajdp^, 

which,  after  integration,  reduces  to  (2.2).  This  Is  an  extension  of  a 
result  of  Skellam  (19'tR)  who  compounded  the  binomial  and  beta  distribu- 
tions. From  the  general  case  (given  by  Hoiseman  (1962)),  it  fol lows  that 
the  (kj,  kj,  kj)-th  factorial  moment  of  h(x;n,a)  for  r - k is 

3 

(E  k } 

1*1 

u(k|,k2,kj)  -n  Btkj+ai.kj+aj.kj+aj.a^l/ptaj  .a2»Oj,o1()  . 

Hence • A 

E(x./n)  - o,/l  a 
1 1 i-l  1 

* * * 

ver(x./n)  - <*.(e  a -a.)(n  ♦ r a.)  /(l  ♦ E a,)n  (A. 1.1) 

1 1 1-1  1 1 i-l  1 l-l  1 

* k l k 

cov(x./n,x.  ,/n)  - -a.a.^E  a,  + n)/(E  o )(l  ♦ E a )n 

1 1 1 1 i-l  1 i-l  1 i-l  1 


In  summery.  h()t;n,a)  is  Intended  to  eccount  for  the  between  trial  variation 
In  £ when  £ varies  between  trials.  In  general,  when  a contingency  tabic 
with  mixed  borders  (see  Cramer  (19^6))  leads  to  e significant  x*  result  end 
when  an  alternative  to  the  multinomial  distribution  is  desired,  application 
of  the  MMB  distribution  is  e natural  recourse. 
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APPENDIX  2 

MAXIMUM  LIKELIHOOD  ESTIMATION  OF  PARAMETERS  OF  THE  MMB  DISTRIBUTION 


mum 


In  fitting  the  MMB  distribution  to  the  data  in  Table  I through  maxi- 
llkellhood  (m.l.)  estimation,  the  log  likelihood  function  Is  given  by 


log  L * constant  + 

*♦,22  ‘ * 

Z log  r(a.+x.,)  "22  log  ?(Z  «.  + 35)  + 22  log  r(l  a.) 

I,J  1 1-1  ' 1-1  ' 

k 

-ME  log  r (a . ) 

1-1  ' 


The  function  log  r (o ) Is  approximated  by  (J)log(2n)  + (e-i)log  0-0  + 

1/120  - 1/36O03  + 1/12600 - 1/168O07  for  6 >_  5.  Repeated  use  of  r(o)  - 

(0-  1 ) r (0 - 1)  Is  made  when  8 < 5 (see  Caratheodory  (1958),  page  297). 

The  m.l.  estimate  of  a,  say,  a_,  Is.  obtained  by  taking  partlals  of  log  L, 
equating  3 log  L/3a,  to  zero, “and  solving  for  that  value  which  maximizes 

(0) 

log  L.  To  calculate  by  Iterative  methods,  an  Initial  value,  say  a_  , 
Is  required  and  can  be  obtained  through  method  of  moments  estimation  as 
follows.  From  (A. 1.1)  we  have 

A,!,,  - E(xj)/E(x1>)  - aj/oj/  (A. 2.1) 

R2IM  " " E(x,)}/U(xJj  '"E (Xj ,))  - (a?+o,)/(«^+0|>) 


These  two  relations  can  be  used  In  Identifying  four  equations  In  the  four 
unknown  otj  after 


22  22 
- Z x ,,/E  x , 
J-1  ,J  j-l.  . J 


22 

E x 
J-l  iJ 


22 

r x.. 

j-l 


22 


J«i 


Vj 


22 

Z x 

J-l 


(A. 2. 2) 


arc  substituted  for  R ^ j . ^ and  R^jj, 

Except  for  the  case  r-2,  a number  of  values  may  have  to  be  con- 
sidered, since  repeated  application  has  shown  that  many  of  these  moment 
estimators  lie  nowhere  In  the  neighborhood  of  in  which  case,  non-con* 
vergcnce  or  covergence  to  a relative  maximum  may  result.  Among  several 
possible  values  for  a (®) , the  appropriate  one  is  that  which  minimizes 


I F 

.. 


I H^(0),  where  H.(o)  • 3logL/;>n. |a  - a ^ 
i = l ' ' ' ~ 


Additionally,  it  is  antici- 


pated that  the  MMB  distribution,  as  it  relates  to  the  data  in  Table  ),  will 
have  a unique  mode,  in  which  case,  > J^.  Using  these  criteria  to  eval- 
uate differing  values  of  a_^  as  defined  by  (A. 2. 2),  we  choose  £ ^ « 

(10.648,  5-993,  4.072,  3-804)'.  For  this  value, H,  - -.262,  H.  - -.255, 

(0)  1 

Hj  * -.608,  » -.349,  so  that  a_'  'appears  to  lie  in  the  neighborhood  of  a_. 

Convergence  to  £ utilizes  a modified  Newton-Raphson  procedure  as 
follows.  The  correction  to  a^ , the  value  obtained  in  the  t-th  iterative 
cycle,  Is  cd  ^ , where  d^  * is  an  rxr  matrix  with 

2 (a\ 

typical  clement  3 log  L/30j  3a  j , | ■ £ , and  3^^  is  an  rxl  vector  with 

typical  element  -Hj(t);  c takes  the  values  .1,  .2 1.0  with  the 

(t)  r 

selected  value  of  a/  ' taken  as  that  for  which  £ h2 ( t ) is  minimized. 


.(0 


1-1 


Values  of  o '*  through  six  iterative  cycles  are  as  follows: 
a^«  (7.73646,  4.35829,  3.46059,  2.95983) 

a}2^-  (7. 908 12,  4.45444,  3.54099,  3.05283) 

•t3)-  (7.89936,  4.44951,  3.53670,  3.04752) 

(7-90126,  4.45056,  3.53759,  3-04854) 

»(5)-  (7.90155,  4.45074,  3.53774,  3.04876) 

a(6)«  (7.90148,  4.45071,  3.53773,  3-04877) 

Substitut  ion  of  a^n  a_  for  a.  and  35  for  n In  (A.I.I)  leads  to  (.417,  .235, 

.I87,  .161)  as  the  estimate  of  E[(I/n)x)  and 

.0187,  -.0075,  -.0052,  -.0060 
.0138,  -.0034,  -.0029 
.0117,  -.0023 
.0104 

as  the  estimate  of  variance  ((i/n)x). 
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APPENDIX  3- 
GOODNESS  OF  FIT 

Goodness  of  fit  is  illustrated  through  the  distribution  of 
k 

y.  * Z x.  ,,  where 
1 i'*i 


h(yj) 

and 


n ! e (E  a..+y.., 

I*i' 


n-<*-y,)/y, ! (n-y.) ! 6 (E  a,.,  a.) 

r*i 


f(y8) 


nlO-Pj)y 


n-y. 

Pj  /y . ! (n-y j ) ! 


for  the  MMB  and  multinomial  distributions,  respectively.  Table  2 presents 
groupings  of  observed  and  expected  y8  , i * 1,...,**,  for  f and  h.  Ranges 
for  each  y8  were  selected  for  demonstrating  the  greater  spread  of  the  HMD 
distribution,  so  that  the  goodness  of  fit  statistic  is  applied  loosely. 

For,  not  only  are  most  observed  cell  frequencies  quite  small,  but  in  three 
cases  there  are  no  degrees  of  freedom,  i.e.,  for  the  MMB  distribution,  there 
Is  a loss  of  five  degrees  of  freedom,  four  for  the  estimation  of  the  a. 
and  one  due  to  the  restriction  that  E x, . ■ 35;  for  the  multinomial 

I ,J 

distribution,  four  degrees  of  freedom  are  lost,  three  for  estimating 


and  one  due  to  £ x, 


35- 


1 


^7  . -».s 

Table  2.  Goodness  of  fit  of  the  MMB  and  Multinomial  (M) 
distributions  to  the  data  in  Table  i. 


Ranges 
for  Y) 

Observed 

Expected 

MMH  M 

Ranges 
for  y2 

Observed 

Expected 
MMB  M 

0-11 

1 

.576 

.036 

0-19 

2 

.785 

.084 

12-17 

5 

5.378 

3.997 

20-22 

2 

1.988 

1.132 

18-21 

5 

7.386 

10.855 

23-25 

3 

4.446 

5.748 

22-26 

8 

7.069 

6.831 

26-29 

9 

8.934 

12.279 

27-30 

2 

1.501 

.279 

30-32 

5 

4.787 

2.64/ 

31-35 

1 

.112 

.002 

33-35 

1 

1.060 

.110 

22 

22.000 

22.000 

22 

22.000 

22.000 

2 

x value 

8.476 

538.042 

2.364 

56.078 

Degrees  of 

Freedom 

1 

2 

1 

2 

Ranges 
for  y3 

Observed 

Expected 
MMB  H 

0-21 

1 

.748 

.061 

22-24 

4 

2.044 

1 .028 

25-28 

5 

7.199 

9.677 

29-32 

9 

9.596 

10.598 

33-35 

3 

2.413 

.636 

57 

22.000 

22.o6o 

7 

x value 

2.809 

343.340 

Degrees  of  Freedom  0 1 


Ranges  Expected 


for  V<| 

Observed 

MMB 

M 

0-24 

1 

1.365 

.234 

25-29 

8 

7.627 

9-156 

30-33 

11 

IO.830 

12.063 

34-35 

2 

2.178 

.547 

57  52.6oo  55.660 


. 144  6.608 

0 
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